Loading…
Measuring Domain Shift for Deep Learning in Histopathology
The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-wor...
Saved in:
Published in: | IEEE journal of biomedical and health informatics 2021-02, Vol.25 (2), p.325-336 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393 |
---|---|
cites | cdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393 |
container_end_page | 336 |
container_issue | 2 |
container_start_page | 325 |
container_title | IEEE journal of biomedical and health informatics |
container_volume | 25 |
creator | Stacke, Karin Eilertsen, Gabriel Unger, Jonas Lundstrom, Claes |
description | The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications. |
doi_str_mv | 10.1109/JBHI.2020.3032060 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_JBHI_2020_3032060</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9234592</ieee_id><sourcerecordid>2487438512</sourcerecordid><originalsourceid>FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</originalsourceid><addsrcrecordid>eNpdkcFO4zAQhi3EaougD7BCQpG4cEmxPXFscwMKlFVXe9iFq-WkdjFK42AnQrw9jlp6wIexNf83I8_8CP0ieEYIlpe_bxaPM4opngEGikt8gI4oKUVOKRaHX28iiwmaxviK0xEpJcufaAKABSspHKGrP0bHIbh2nc39Rrs2-_fibJ9ZH7K5MV22NDq0o5ykhYu973T_4hu__jhBP6xuopnu7mP0dH_3_3aRL_8-PN5eL_O6kGWf16wEa7TlQIiuBJRVLTm3vFgBsyuBGQdWSwnAV5KC4DSRktbaSFtxCRKOUb7tG99NN1SqC26jw4fy2qm5e75WPqxV4wZF-Dhh4i-2fBf822BirzYu1qZpdGv8EBUtGJRCMC4Sev4NffVDaNM0iRK8AMEITRTZUnXwMQZj918gWI1eqNELNXqhdl6kmrNd56HamNW-4mvzCTjdAs4Ys5fTBgqWwid8Wopp</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2487438512</pqid></control><display><type>article</type><title>Measuring Domain Shift for Deep Learning in Histopathology</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Stacke, Karin ; Eilertsen, Gabriel ; Unger, Jonas ; Lundstrom, Claes</creator><creatorcontrib>Stacke, Karin ; Eilertsen, Gabriel ; Unger, Jonas ; Lundstrom, Claes</creatorcontrib><description>The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.</description><identifier>ISSN: 2168-2194</identifier><identifier>ISSN: 2168-2208</identifier><identifier>EISSN: 2168-2208</identifier><identifier>DOI: 10.1109/JBHI.2020.3032060</identifier><identifier>PMID: 33085623</identifier><identifier>CODEN: IJBHA9</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Artificial intelligence ; Artificial neural networks ; Biomedical imaging ; Biomedical measurement ; Correlation analysis ; Data models ; Deep learning ; Digital imaging ; domain shift ; Domains ; Feature extraction ; Health care facilities ; Histopathology ; Image acquisition ; Image classification ; Image color analysis ; learning (artificial intelligence) ; Machine learning ; Model testing ; Neural networks ; Pathology ; Representations ; Statistical tests ; unsupervised learning</subject><ispartof>IEEE journal of biomedical and health informatics, 2021-02, Vol.25 (2), p.325-336</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</citedby><cites>FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</cites><orcidid>0000-0002-7765-1747 ; 0000-0002-9368-0177 ; 0000-0003-1066-3070</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9234592$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>230,314,776,780,881,27901,27902,54771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33085623$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-170816$$DView record from Swedish Publication Index$$Hfree_for_read</backlink></links><search><creatorcontrib>Stacke, Karin</creatorcontrib><creatorcontrib>Eilertsen, Gabriel</creatorcontrib><creatorcontrib>Unger, Jonas</creatorcontrib><creatorcontrib>Lundstrom, Claes</creatorcontrib><title>Measuring Domain Shift for Deep Learning in Histopathology</title><title>IEEE journal of biomedical and health informatics</title><addtitle>JBHI</addtitle><addtitle>IEEE J Biomed Health Inform</addtitle><description>The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.</description><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>Biomedical imaging</subject><subject>Biomedical measurement</subject><subject>Correlation analysis</subject><subject>Data models</subject><subject>Deep learning</subject><subject>Digital imaging</subject><subject>domain shift</subject><subject>Domains</subject><subject>Feature extraction</subject><subject>Health care facilities</subject><subject>Histopathology</subject><subject>Image acquisition</subject><subject>Image classification</subject><subject>Image color analysis</subject><subject>learning (artificial intelligence)</subject><subject>Machine learning</subject><subject>Model testing</subject><subject>Neural networks</subject><subject>Pathology</subject><subject>Representations</subject><subject>Statistical tests</subject><subject>unsupervised learning</subject><issn>2168-2194</issn><issn>2168-2208</issn><issn>2168-2208</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNpdkcFO4zAQhi3EaougD7BCQpG4cEmxPXFscwMKlFVXe9iFq-WkdjFK42AnQrw9jlp6wIexNf83I8_8CP0ieEYIlpe_bxaPM4opngEGikt8gI4oKUVOKRaHX28iiwmaxviK0xEpJcufaAKABSspHKGrP0bHIbh2nc39Rrs2-_fibJ9ZH7K5MV22NDq0o5ykhYu973T_4hu__jhBP6xuopnu7mP0dH_3_3aRL_8-PN5eL_O6kGWf16wEa7TlQIiuBJRVLTm3vFgBsyuBGQdWSwnAV5KC4DSRktbaSFtxCRKOUb7tG99NN1SqC26jw4fy2qm5e75WPqxV4wZF-Dhh4i-2fBf822BirzYu1qZpdGv8EBUtGJRCMC4Sev4NffVDaNM0iRK8AMEITRTZUnXwMQZj918gWI1eqNELNXqhdl6kmrNd56HamNW-4mvzCTjdAs4Ys5fTBgqWwid8Wopp</recordid><startdate>20210201</startdate><enddate>20210201</enddate><creator>Stacke, Karin</creator><creator>Eilertsen, Gabriel</creator><creator>Unger, Jonas</creator><creator>Lundstrom, Claes</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>NAPCQ</scope><scope>P64</scope><scope>7X8</scope><scope>ABXSW</scope><scope>ADTPV</scope><scope>AOWAS</scope><scope>D8T</scope><scope>DG8</scope><scope>ZZAVC</scope><orcidid>https://orcid.org/0000-0002-7765-1747</orcidid><orcidid>https://orcid.org/0000-0002-9368-0177</orcidid><orcidid>https://orcid.org/0000-0003-1066-3070</orcidid></search><sort><creationdate>20210201</creationdate><title>Measuring Domain Shift for Deep Learning in Histopathology</title><author>Stacke, Karin ; Eilertsen, Gabriel ; Unger, Jonas ; Lundstrom, Claes</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>Biomedical imaging</topic><topic>Biomedical measurement</topic><topic>Correlation analysis</topic><topic>Data models</topic><topic>Deep learning</topic><topic>Digital imaging</topic><topic>domain shift</topic><topic>Domains</topic><topic>Feature extraction</topic><topic>Health care facilities</topic><topic>Histopathology</topic><topic>Image acquisition</topic><topic>Image classification</topic><topic>Image color analysis</topic><topic>learning (artificial intelligence)</topic><topic>Machine learning</topic><topic>Model testing</topic><topic>Neural networks</topic><topic>Pathology</topic><topic>Representations</topic><topic>Statistical tests</topic><topic>unsupervised learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stacke, Karin</creatorcontrib><creatorcontrib>Eilertsen, Gabriel</creatorcontrib><creatorcontrib>Unger, Jonas</creatorcontrib><creatorcontrib>Lundstrom, Claes</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Xplore</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Nursing & Allied Health Premium</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>SWEPUB Linköpings universitet full text</collection><collection>SwePub</collection><collection>SwePub Articles</collection><collection>SWEPUB Freely available online</collection><collection>SWEPUB Linköpings universitet</collection><collection>SwePub Articles full text</collection><jtitle>IEEE journal of biomedical and health informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stacke, Karin</au><au>Eilertsen, Gabriel</au><au>Unger, Jonas</au><au>Lundstrom, Claes</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Measuring Domain Shift for Deep Learning in Histopathology</atitle><jtitle>IEEE journal of biomedical and health informatics</jtitle><stitle>JBHI</stitle><addtitle>IEEE J Biomed Health Inform</addtitle><date>2021-02-01</date><risdate>2021</risdate><volume>25</volume><issue>2</issue><spage>325</spage><epage>336</epage><pages>325-336</pages><issn>2168-2194</issn><issn>2168-2208</issn><eissn>2168-2208</eissn><coden>IJBHA9</coden><abstract>The high capacity of neural networks allows fitting models to data with high precision, but makes generalization to unseen data a challenge. If a domain shift exists, i.e. differences in image statistics between training and test data, care needs to be taken to ensure reliable deployment in real-world scenarios. In digital pathology, domain shift can be manifested in differences between whole-slide images, introduced by for example differences in acquisition pipeline - between medical centers or over time. In order to harness the great potential presented by deep learning in histopathology, and ensure consistent model behavior, we need a deeper understanding of domain shift and its consequences, such that a model's predictions on new data can be trusted. This work focuses on the internal representation learned by trained convolutional neural networks, and shows how this can be used to formulate a novel measure - the representation shift - for quantifying the magnitude of model-specific domain shift. We perform a study on domain shift in tumor classification of hematoxylin and eosin stained images, by considering different datasets, models, and techniques for preparing data in order to reduce the domain shift. The results show how the proposed measure has a high correlation with drop in performance when testing a model across a large number of different types of domain shifts, and how it improves on existing techniques for measuring data shift and uncertainty. The proposed measure can reveal how sensitive a model is to domain variations, and can be used to detect new data that a model will have problems generalizing to. We see techniques for measuring, understanding and overcoming the domain shift as a crucial step towards reliable use of deep learning in the future clinical pathology applications.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>33085623</pmid><doi>10.1109/JBHI.2020.3032060</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0002-7765-1747</orcidid><orcidid>https://orcid.org/0000-0002-9368-0177</orcidid><orcidid>https://orcid.org/0000-0003-1066-3070</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2168-2194 |
ispartof | IEEE journal of biomedical and health informatics, 2021-02, Vol.25 (2), p.325-336 |
issn | 2168-2194 2168-2208 2168-2208 |
language | eng |
recordid | cdi_crossref_primary_10_1109_JBHI_2020_3032060 |
source | IEEE Electronic Library (IEL) Journals |
subjects | Artificial intelligence Artificial neural networks Biomedical imaging Biomedical measurement Correlation analysis Data models Deep learning Digital imaging domain shift Domains Feature extraction Health care facilities Histopathology Image acquisition Image classification Image color analysis learning (artificial intelligence) Machine learning Model testing Neural networks Pathology Representations Statistical tests unsupervised learning |
title | Measuring Domain Shift for Deep Learning in Histopathology |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T04%3A38%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Measuring%20Domain%20Shift%20for%20Deep%20Learning%20in%20Histopathology&rft.jtitle=IEEE%20journal%20of%20biomedical%20and%20health%20informatics&rft.au=Stacke,%20Karin&rft.date=2021-02-01&rft.volume=25&rft.issue=2&rft.spage=325&rft.epage=336&rft.pages=325-336&rft.issn=2168-2194&rft.eissn=2168-2208&rft.coden=IJBHA9&rft_id=info:doi/10.1109/JBHI.2020.3032060&rft_dat=%3Cproquest_cross%3E2487438512%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c496t-c563feaf7311ab836bc977f74d35fd805735c99337d923872f7392cae9fb79393%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2487438512&rft_id=info:pmid/33085623&rft_ieee_id=9234592&rfr_iscdi=true |