Loading…
Between Generating Noise and Generating Images: Noise in the Correct Frequency Improves the Quality of Synthetic Histopathology Images for Digital Pathology
Artificial intelligence and machine learning techniques have the promise to revolutionize the field of digital pathology. However, these models demand considerable amounts of data, while the availability of unbiased training data is limited. Synthetic images can augment existing datasets, to improve...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 7 |
container_issue | |
container_start_page | 1 |
container_title | |
container_volume | 2023 |
creator | Daniel, Nati Aknin, Eliel Larey, Ariel Peretz, Yoni Sela, Guy Fisher, Yael Savir, Yonatan |
description | Artificial intelligence and machine learning techniques have the promise to revolutionize the field of digital pathology. However, these models demand considerable amounts of data, while the availability of unbiased training data is limited. Synthetic images can augment existing datasets, to improve and validate AI algorithms. Yet, controlling the exact distribution of cellular features within them is still challenging. One of the solutions is harnessing conditional generative adversarial networks that take a semantic mask as an input rather than a random noise. Unlike other domains, outlining the exact cellular structure of tissues is hard, and most of the input masks depict regions of cell types. This is also the case for non-small cell lung cancer, the most common type of lung cancer. Deciding whether a patient would receive immunotherapy depends on quantifying regions of stained cells. However, using polygon-based masks introduce inherent artifacts within the synthetic images - due to the mismatch between the polygon size and the single-cell size. In this work, we show that introducing random single-pixel noise with the appropriate spatial frequency into a polygon semantic mask can dramatically improve the quality of the synthetic images. We used our platform to generate synthetic images of immunohistochemistry-treated lung biopsies. We test the quality of the images using a three-fold validation procedure. First, we show that adding the appropriate noise frequency yields 87% of the similarity metrics improvement that is obtained by adding the actual single-cell features. Second, we show that the synthetic images pass the Turing test. Finally, we show that adding these synthetic images to the train set improves AI performance in terms of PD-L1 semantic segmentation performances. Our work suggests a simple and powerful approach for generating synthetic data on demand to unbias limited datasets to improve the algorithms' accuracy and validate their robustness. |
doi_str_mv | 10.1109/EMBC40787.2023.10341042 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>proquest_CHZPO</sourceid><recordid>TN_cdi_pubmed_primary_38083579</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10341042</ieee_id><sourcerecordid>2902933101</sourcerecordid><originalsourceid>FETCH-LOGICAL-i260t-363f83de1d0780001334df15b323da8b56e13b05ff7bc6819be7ca25e9c0fdb63</originalsourceid><addsrcrecordid>eNpNkctOwzAQRQ0SAlT6Bwi8ZNMy9jgPs4PylHgKWFdOMimW0rjELij_wsdiQYtYjXTv0dXMHcYOBYyFAH18cXc2UZDl2ViCxLEAVAKU3GBDnekcE0CpVCY22a5MtRpBCmqHDb23BSSYqERL3GY7mENkM73Lvs4ofBK1_Ipa6kyw7YzfO-uJm7b6L97MzYz8ycq0LQ9vxCeu66gM_LKj9yW1ZR-xRec-yP_YT0vT2NBzV_Pnvo1KsCW_tj64hQlvrnGzfpXLa9fxczuzwTT8cW3usa3aNJ6Gqzlgr5cXL5Pr0e3D1c3k9HZkZQphhCnWOVYkqlgMAAhEVdUiKVBiZfIiSUlgvL-us6JMc6ELykojE9Il1FWR4oAd_ebG3eMdPkzn1pfUNKYlt_RTqUFqRBGTB-xghS6LOVXTRWfnpuun60YjsP8LWCL6s9d_wm-iAIbt</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>2902933101</pqid></control><display><type>conference_proceeding</type><title>Between Generating Noise and Generating Images: Noise in the Correct Frequency Improves the Quality of Synthetic Histopathology Images for Digital Pathology</title><source>IEEE Xplore All Conference Series</source><creator>Daniel, Nati ; Aknin, Eliel ; Larey, Ariel ; Peretz, Yoni ; Sela, Guy ; Fisher, Yael ; Savir, Yonatan</creator><creatorcontrib>Daniel, Nati ; Aknin, Eliel ; Larey, Ariel ; Peretz, Yoni ; Sela, Guy ; Fisher, Yael ; Savir, Yonatan</creatorcontrib><description>Artificial intelligence and machine learning techniques have the promise to revolutionize the field of digital pathology. However, these models demand considerable amounts of data, while the availability of unbiased training data is limited. Synthetic images can augment existing datasets, to improve and validate AI algorithms. Yet, controlling the exact distribution of cellular features within them is still challenging. One of the solutions is harnessing conditional generative adversarial networks that take a semantic mask as an input rather than a random noise. Unlike other domains, outlining the exact cellular structure of tissues is hard, and most of the input masks depict regions of cell types. This is also the case for non-small cell lung cancer, the most common type of lung cancer. Deciding whether a patient would receive immunotherapy depends on quantifying regions of stained cells. However, using polygon-based masks introduce inherent artifacts within the synthetic images - due to the mismatch between the polygon size and the single-cell size. In this work, we show that introducing random single-pixel noise with the appropriate spatial frequency into a polygon semantic mask can dramatically improve the quality of the synthetic images. We used our platform to generate synthetic images of immunohistochemistry-treated lung biopsies. We test the quality of the images using a three-fold validation procedure. First, we show that adding the appropriate noise frequency yields 87% of the similarity metrics improvement that is obtained by adding the actual single-cell features. Second, we show that the synthetic images pass the Turing test. Finally, we show that adding these synthetic images to the train set improves AI performance in terms of PD-L1 semantic segmentation performances. Our work suggests a simple and powerful approach for generating synthetic data on demand to unbias limited datasets to improve the algorithms' accuracy and validate their robustness.</description><identifier>EISSN: 2694-0604</identifier><identifier>EISBN: 9798350324471</identifier><identifier>DOI: 10.1109/EMBC40787.2023.10341042</identifier><identifier>PMID: 38083579</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Algorithms ; Artificial Intelligence ; Biopsy image generation ; Carcinoma, Non-Small-Cell Lung ; Deep learning ; Digital pathology ; Humans ; Image translation ; Lung ; Lung cancer ; Lung Neoplasms - diagnostic imaging ; Measurement ; Non-small cell lung carcinoma ; Programmed death-ligand 1 ; Robustness ; Semantics ; Synthetic data</subject><ispartof>2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2023, Vol.2023, p.1-7</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10341042$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,314,776,780,785,786,27903,27904,54533,54910</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10341042$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38083579$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Daniel, Nati</creatorcontrib><creatorcontrib>Aknin, Eliel</creatorcontrib><creatorcontrib>Larey, Ariel</creatorcontrib><creatorcontrib>Peretz, Yoni</creatorcontrib><creatorcontrib>Sela, Guy</creatorcontrib><creatorcontrib>Fisher, Yael</creatorcontrib><creatorcontrib>Savir, Yonatan</creatorcontrib><title>Between Generating Noise and Generating Images: Noise in the Correct Frequency Improves the Quality of Synthetic Histopathology Images for Digital Pathology</title><title>2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)</title><addtitle>EMBC</addtitle><addtitle>Annu Int Conf IEEE Eng Med Biol Soc</addtitle><description>Artificial intelligence and machine learning techniques have the promise to revolutionize the field of digital pathology. However, these models demand considerable amounts of data, while the availability of unbiased training data is limited. Synthetic images can augment existing datasets, to improve and validate AI algorithms. Yet, controlling the exact distribution of cellular features within them is still challenging. One of the solutions is harnessing conditional generative adversarial networks that take a semantic mask as an input rather than a random noise. Unlike other domains, outlining the exact cellular structure of tissues is hard, and most of the input masks depict regions of cell types. This is also the case for non-small cell lung cancer, the most common type of lung cancer. Deciding whether a patient would receive immunotherapy depends on quantifying regions of stained cells. However, using polygon-based masks introduce inherent artifacts within the synthetic images - due to the mismatch between the polygon size and the single-cell size. In this work, we show that introducing random single-pixel noise with the appropriate spatial frequency into a polygon semantic mask can dramatically improve the quality of the synthetic images. We used our platform to generate synthetic images of immunohistochemistry-treated lung biopsies. We test the quality of the images using a three-fold validation procedure. First, we show that adding the appropriate noise frequency yields 87% of the similarity metrics improvement that is obtained by adding the actual single-cell features. Second, we show that the synthetic images pass the Turing test. Finally, we show that adding these synthetic images to the train set improves AI performance in terms of PD-L1 semantic segmentation performances. Our work suggests a simple and powerful approach for generating synthetic data on demand to unbias limited datasets to improve the algorithms' accuracy and validate their robustness.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Biopsy image generation</subject><subject>Carcinoma, Non-Small-Cell Lung</subject><subject>Deep learning</subject><subject>Digital pathology</subject><subject>Humans</subject><subject>Image translation</subject><subject>Lung</subject><subject>Lung cancer</subject><subject>Lung Neoplasms - diagnostic imaging</subject><subject>Measurement</subject><subject>Non-small cell lung carcinoma</subject><subject>Programmed death-ligand 1</subject><subject>Robustness</subject><subject>Semantics</subject><subject>Synthetic data</subject><issn>2694-0604</issn><isbn>9798350324471</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpNkctOwzAQRQ0SAlT6Bwi8ZNMy9jgPs4PylHgKWFdOMimW0rjELij_wsdiQYtYjXTv0dXMHcYOBYyFAH18cXc2UZDl2ViCxLEAVAKU3GBDnekcE0CpVCY22a5MtRpBCmqHDb23BSSYqERL3GY7mENkM73Lvs4ofBK1_Ipa6kyw7YzfO-uJm7b6L97MzYz8ycq0LQ9vxCeu66gM_LKj9yW1ZR-xRec-yP_YT0vT2NBzV_Pnvo1KsCW_tj64hQlvrnGzfpXLa9fxczuzwTT8cW3usa3aNJ6Gqzlgr5cXL5Pr0e3D1c3k9HZkZQphhCnWOVYkqlgMAAhEVdUiKVBiZfIiSUlgvL-us6JMc6ELykojE9Il1FWR4oAd_ebG3eMdPkzn1pfUNKYlt_RTqUFqRBGTB-xghS6LOVXTRWfnpuun60YjsP8LWCL6s9d_wm-iAIbt</recordid><startdate>20230101</startdate><enddate>20230101</enddate><creator>Daniel, Nati</creator><creator>Aknin, Eliel</creator><creator>Larey, Ariel</creator><creator>Peretz, Yoni</creator><creator>Sela, Guy</creator><creator>Fisher, Yael</creator><creator>Savir, Yonatan</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>7X8</scope></search><sort><creationdate>20230101</creationdate><title>Between Generating Noise and Generating Images: Noise in the Correct Frequency Improves the Quality of Synthetic Histopathology Images for Digital Pathology</title><author>Daniel, Nati ; Aknin, Eliel ; Larey, Ariel ; Peretz, Yoni ; Sela, Guy ; Fisher, Yael ; Savir, Yonatan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i260t-363f83de1d0780001334df15b323da8b56e13b05ff7bc6819be7ca25e9c0fdb63</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Biopsy image generation</topic><topic>Carcinoma, Non-Small-Cell Lung</topic><topic>Deep learning</topic><topic>Digital pathology</topic><topic>Humans</topic><topic>Image translation</topic><topic>Lung</topic><topic>Lung cancer</topic><topic>Lung Neoplasms - diagnostic imaging</topic><topic>Measurement</topic><topic>Non-small cell lung carcinoma</topic><topic>Programmed death-ligand 1</topic><topic>Robustness</topic><topic>Semantics</topic><topic>Synthetic data</topic><toplevel>online_resources</toplevel><creatorcontrib>Daniel, Nati</creatorcontrib><creatorcontrib>Aknin, Eliel</creatorcontrib><creatorcontrib>Larey, Ariel</creatorcontrib><creatorcontrib>Peretz, Yoni</creatorcontrib><creatorcontrib>Sela, Guy</creatorcontrib><creatorcontrib>Fisher, Yael</creatorcontrib><creatorcontrib>Savir, Yonatan</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>MEDLINE - Academic</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Daniel, Nati</au><au>Aknin, Eliel</au><au>Larey, Ariel</au><au>Peretz, Yoni</au><au>Sela, Guy</au><au>Fisher, Yael</au><au>Savir, Yonatan</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Between Generating Noise and Generating Images: Noise in the Correct Frequency Improves the Quality of Synthetic Histopathology Images for Digital Pathology</atitle><btitle>2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)</btitle><stitle>EMBC</stitle><addtitle>Annu Int Conf IEEE Eng Med Biol Soc</addtitle><date>2023-01-01</date><risdate>2023</risdate><volume>2023</volume><spage>1</spage><epage>7</epage><pages>1-7</pages><eissn>2694-0604</eissn><eisbn>9798350324471</eisbn><abstract>Artificial intelligence and machine learning techniques have the promise to revolutionize the field of digital pathology. However, these models demand considerable amounts of data, while the availability of unbiased training data is limited. Synthetic images can augment existing datasets, to improve and validate AI algorithms. Yet, controlling the exact distribution of cellular features within them is still challenging. One of the solutions is harnessing conditional generative adversarial networks that take a semantic mask as an input rather than a random noise. Unlike other domains, outlining the exact cellular structure of tissues is hard, and most of the input masks depict regions of cell types. This is also the case for non-small cell lung cancer, the most common type of lung cancer. Deciding whether a patient would receive immunotherapy depends on quantifying regions of stained cells. However, using polygon-based masks introduce inherent artifacts within the synthetic images - due to the mismatch between the polygon size and the single-cell size. In this work, we show that introducing random single-pixel noise with the appropriate spatial frequency into a polygon semantic mask can dramatically improve the quality of the synthetic images. We used our platform to generate synthetic images of immunohistochemistry-treated lung biopsies. We test the quality of the images using a three-fold validation procedure. First, we show that adding the appropriate noise frequency yields 87% of the similarity metrics improvement that is obtained by adding the actual single-cell features. Second, we show that the synthetic images pass the Turing test. Finally, we show that adding these synthetic images to the train set improves AI performance in terms of PD-L1 semantic segmentation performances. Our work suggests a simple and powerful approach for generating synthetic data on demand to unbias limited datasets to improve the algorithms' accuracy and validate their robustness.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>38083579</pmid><doi>10.1109/EMBC40787.2023.10341042</doi><tpages>7</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | EISSN: 2694-0604 |
ispartof | 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2023, Vol.2023, p.1-7 |
issn | 2694-0604 |
language | eng |
recordid | cdi_pubmed_primary_38083579 |
source | IEEE Xplore All Conference Series |
subjects | Algorithms Artificial Intelligence Biopsy image generation Carcinoma, Non-Small-Cell Lung Deep learning Digital pathology Humans Image translation Lung Lung cancer Lung Neoplasms - diagnostic imaging Measurement Non-small cell lung carcinoma Programmed death-ligand 1 Robustness Semantics Synthetic data |
title | Between Generating Noise and Generating Images: Noise in the Correct Frequency Improves the Quality of Synthetic Histopathology Images for Digital Pathology |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T17%3A29%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Between%20Generating%20Noise%20and%20Generating%20Images:%20Noise%20in%20the%20Correct%20Frequency%20Improves%20the%20Quality%20of%20Synthetic%20Histopathology%20Images%20for%20Digital%20Pathology&rft.btitle=2023%2045th%20Annual%20International%20Conference%20of%20the%20IEEE%20Engineering%20in%20Medicine%20&%20Biology%20Society%20(EMBC)&rft.au=Daniel,%20Nati&rft.date=2023-01-01&rft.volume=2023&rft.spage=1&rft.epage=7&rft.pages=1-7&rft.eissn=2694-0604&rft_id=info:doi/10.1109/EMBC40787.2023.10341042&rft.eisbn=9798350324471&rft_dat=%3Cproquest_CHZPO%3E2902933101%3C/proquest_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i260t-363f83de1d0780001334df15b323da8b56e13b05ff7bc6819be7ca25e9c0fdb63%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2902933101&rft_id=info:pmid/38083579&rft_ieee_id=10341042&rfr_iscdi=true |