Loading…

Measuring Domain Shift for Deep Learning in Histopathology

The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-wor...

Full description

Saved in:
Bibliographic Details
Published in:IEEE journal of biomedical and health informatics 2021-02, Vol.25 (2), p.325-336
Main Authors: Stacke, Karin, Eilertsen, Gabriel, Unger, Jonas, Lundstrom, Claes
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393
cites cdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393
container_end_page 336
container_issue 2
container_start_page 325
container_title IEEE journal of biomedical and health informatics
container_volume 25
creator Stacke, Karin
Eilertsen, Gabriel
Unger, Jonas
Lundstrom, Claes
description The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.
doi_str_mv 10.1109/JBHI.2020.3032060
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_JBHI_2020_3032060</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9234592</ieee_id><sourcerecordid>2487438512</sourcerecordid><originalsourceid>FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</originalsourceid><addsrcrecordid>eNpdkcFO4zAQhi3EaougD7BCQpG4cEmxPXFscwMKlFVXe9iFq-WkdjFK42AnQrw9jlp6wIexNf83I8_8CP0ieEYIlpe_bxaPM4opngEGikt8gI4oKUVOKRaHX28iiwmaxviK0xEpJcufaAKABSspHKGrP0bHIbh2nc39Rrs2-_fibJ9ZH7K5MV22NDq0o5ykhYu973T_4hu__jhBP6xuopnu7mP0dH_3_3aRL_8-PN5eL_O6kGWf16wEa7TlQIiuBJRVLTm3vFgBsyuBGQdWSwnAV5KC4DSRktbaSFtxCRKOUb7tG99NN1SqC26jw4fy2qm5e75WPqxV4wZF-Dhh4i-2fBf822BirzYu1qZpdGv8EBUtGJRCMC4Sev4NffVDaNM0iRK8AMEITRTZUnXwMQZj918gWI1eqNELNXqhdl6kmrNd56HamNW-4mvzCTjdAs4Ys5fTBgqWwid8Wopp</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2487438512</pqid></control><display><type>article</type><title>Measuring Domain Shift for Deep Learning in Histopathology</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Stacke, Karin ; Eilertsen, Gabriel ; Unger, Jonas ; Lundstrom, Claes</creator><creatorcontrib>Stacke, Karin ; Eilertsen, Gabriel ; Unger, Jonas ; Lundstrom, Claes</creatorcontrib><description>The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.</description><identifier>ISSN: 2168-2194</identifier><identifier>ISSN: 2168-2208</identifier><identifier>EISSN: 2168-2208</identifier><identifier>DOI: 10.1109/JBHI.2020.3032060</identifier><identifier>PMID: 33085623</identifier><identifier>CODEN: IJBHA9</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Artificial intelligence ; Artificial neural networks ; Biomedical imaging ; Biomedical measurement ; Correlation analysis ; Data models ; Deep learning ; Digital imaging ; domain shift ; Domains ; Feature extraction ; Health care facilities ; Histopathology ; Image acquisition ; Image classification ; Image color analysis ; learning (artificial intelligence) ; Machine learning ; Model testing ; Neural networks ; Pathology ; Representations ; Statistical tests ; unsupervised learning</subject><ispartof>IEEE journal of biomedical and health informatics, 2021-02, Vol.25 (2), p.325-336</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</citedby><cites>FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</cites><orcidid>0000-0002-7765-1747 ; 0000-0002-9368-0177 ; 0000-0003-1066-3070</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9234592$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>230,314,776,780,881,27901,27902,54771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33085623$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-170816$$DView record from Swedish Publication Index$$Hfree_for_read</backlink></links><search><creatorcontrib>Stacke, Karin</creatorcontrib><creatorcontrib>Eilertsen, Gabriel</creatorcontrib><creatorcontrib>Unger, Jonas</creatorcontrib><creatorcontrib>Lundstrom, Claes</creatorcontrib><title>Measuring Domain Shift for Deep Learning in Histopathology</title><title>IEEE journal of biomedical and health informatics</title><addtitle>JBHI</addtitle><addtitle>IEEE J Biomed Health Inform</addtitle><description>The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.</description><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>Biomedical imaging</subject><subject>Biomedical measurement</subject><subject>Correlation analysis</subject><subject>Data models</subject><subject>Deep learning</subject><subject>Digital imaging</subject><subject>domain shift</subject><subject>Domains</subject><subject>Feature extraction</subject><subject>Health care facilities</subject><subject>Histopathology</subject><subject>Image acquisition</subject><subject>Image classification</subject><subject>Image color analysis</subject><subject>learning (artificial intelligence)</subject><subject>Machine learning</subject><subject>Model testing</subject><subject>Neural networks</subject><subject>Pathology</subject><subject>Representations</subject><subject>Statistical tests</subject><subject>unsupervised learning</subject><issn>2168-2194</issn><issn>2168-2208</issn><issn>2168-2208</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNpdkcFO4zAQhi3EaougD7BCQpG4cEmxPXFscwMKlFVXe9iFq-WkdjFK42AnQrw9jlp6wIexNf83I8_8CP0ieEYIlpe_bxaPM4opngEGikt8gI4oKUVOKRaHX28iiwmaxviK0xEpJcufaAKABSspHKGrP0bHIbh2nc39Rrs2-_fibJ9ZH7K5MV22NDq0o5ykhYu973T_4hu__jhBP6xuopnu7mP0dH_3_3aRL_8-PN5eL_O6kGWf16wEa7TlQIiuBJRVLTm3vFgBsyuBGQdWSwnAV5KC4DSRktbaSFtxCRKOUb7tG99NN1SqC26jw4fy2qm5e75WPqxV4wZF-Dhh4i-2fBf822BirzYu1qZpdGv8EBUtGJRCMC4Sev4NffVDaNM0iRK8AMEITRTZUnXwMQZj918gWI1eqNELNXqhdl6kmrNd56HamNW-4mvzCTjdAs4Ys5fTBgqWwid8Wopp</recordid><startdate>20210201</startdate><enddate>20210201</enddate><creator>Stacke, Karin</creator><creator>Eilertsen, Gabriel</creator><creator>Unger, Jonas</creator><creator>Lundstrom, Claes</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>NAPCQ</scope><scope>P64</scope><scope>7X8</scope><scope>ABXSW</scope><scope>ADTPV</scope><scope>AOWAS</scope><scope>D8T</scope><scope>DG8</scope><scope>ZZAVC</scope><orcidid>https://orcid.org/0000-0002-7765-1747</orcidid><orcidid>https://orcid.org/0000-0002-9368-0177</orcidid><orcidid>https://orcid.org/0000-0003-1066-3070</orcidid></search><sort><creationdate>20210201</creationdate><title>Measuring Domain Shift for Deep Learning in Histopathology</title><author>Stacke, Karin ; Eilertsen, Gabriel ; Unger, Jonas ; Lundstrom, Claes</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>Biomedical imaging</topic><topic>Biomedical measurement</topic><topic>Correlation analysis</topic><topic>Data models</topic><topic>Deep learning</topic><topic>Digital imaging</topic><topic>domain shift</topic><topic>Domains</topic><topic>Feature extraction</topic><topic>Health care facilities</topic><topic>Histopathology</topic><topic>Image acquisition</topic><topic>Image classification</topic><topic>Image color analysis</topic><topic>learning (artificial intelligence)</topic><topic>Machine learning</topic><topic>Model testing</topic><topic>Neural networks</topic><topic>Pathology</topic><topic>Representations</topic><topic>Statistical tests</topic><topic>unsupervised learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stacke, Karin</creatorcontrib><creatorcontrib>Eilertsen, Gabriel</creatorcontrib><creatorcontrib>Unger, Jonas</creatorcontrib><creatorcontrib>Lundstrom, Claes</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Xplore</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>SWEPUB Linköpings universitet full text</collection><collection>SwePub</collection><collection>SwePub Articles</collection><collection>SWEPUB Freely available online</collection><collection>SWEPUB Linköpings universitet</collection><collection>SwePub Articles full text</collection><jtitle>IEEE journal of biomedical and health informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stacke, Karin</au><au>Eilertsen, Gabriel</au><au>Unger, Jonas</au><au>Lundstrom, Claes</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Measuring Domain Shift for Deep Learning in Histopathology</atitle><jtitle>IEEE journal of biomedical and health informatics</jtitle><stitle>JBHI</stitle><addtitle>IEEE J Biomed Health Inform</addtitle><date>2021-02-01</date><risdate>2021</risdate><volume>25</volume><issue>2</issue><spage>325</spage><epage>336</epage><pages>325-336</pages><issn>2168-2194</issn><issn>2168-2208</issn><eissn>2168-2208</eissn><coden>IJBHA9</coden><abstract>The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>33085623</pmid><doi>10.1109/JBHI.2020.3032060</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0002-7765-1747</orcidid><orcidid>https://orcid.org/0000-0002-9368-0177</orcidid><orcidid>https://orcid.org/0000-0003-1066-3070</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2168-2194
ispartof IEEE journal of biomedical and health informatics, 2021-02, Vol.25 (2), p.325-336
issn 2168-2194
2168-2208
2168-2208
language eng
recordid cdi_crossref_primary_10_1109_JBHI_2020_3032060
source IEEE Electronic Library (IEL) Journals
subjects Artificial intelligence
Artificial neural networks
Biomedical imaging
Biomedical measurement
Correlation analysis
Data models
Deep learning
Digital imaging
domain shift
Domains
Feature extraction
Health care facilities
Histopathology
Image acquisition
Image classification
Image color analysis
learning (artificial intelligence)
Machine learning
Model testing
Neural networks
Pathology
Representations
Statistical tests
unsupervised learning
title Measuring Domain Shift for Deep Learning in Histopathology
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T04%3A38%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Measuring%20Domain%20Shift%20for%20Deep%20Learning%20in%20Histopathology&rft.jtitle=IEEE%20journal%20of%20biomedical%20and%20health%20informatics&rft.au=Stacke,%20Karin&rft.date=2021-02-01&rft.volume=25&rft.issue=2&rft.spage=325&rft.epage=336&rft.pages=325-336&rft.issn=2168-2194&rft.eissn=2168-2208&rft.coden=IJBHA9&rft_id=info:doi/10.1109/JBHI.2020.3032060&rft_dat=%3Cproquest_cross%3E2487438512%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2487438512&rft_id=info:pmid/33085623&rft_ieee_id=9234592&rfr_iscdi=true