Loading…

D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images

Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It se...

Full description

Saved in:
Bibliographic Details
Published in:Computers in biology and medicine 2024-06, Vol.176, p.108590, Article 108590
Main Authors: Bougourzi, Fares, Dornaika, Fadi, Distante, Cosimo, Taleb-Ahmed, Abdelmalik
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c3460-dfbd790aa52bf0a8c49ffe877ca17cbf27fb5fb7accf71666c85b13b440c727c3
container_end_page
container_issue
container_start_page 108590
container_title Computers in biology and medicine
container_volume 176
creator Bougourzi, Fares
Dornaika, Fadi
Distante, Cosimo
Taleb-Ahmed, Abdelmalik
description Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. •We propose an Efficient Hybrid CNN-Transformer architecture.•Our proposed approach Encoder has Transformer path and Encoders Fusion.•Our approach is designed with Multi-tasks infection-organ-attention Decoders.•The performance is evaluated for Segmenting Bone Metastases and Covid-19 infection.•The proposed Hybrid Encoder has been evaluated for Segmenting Glands and Nuclei.
doi_str_mv 10.1016/j.compbiomed.2024.108590
format article
fullrecord <record><control><sourceid>proquest_hal_p</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_04607102v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0010482524006759</els_id><sourcerecordid>3060766191</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3460-dfbd790aa52bf0a8c49ffe877ca17cbf27fb5fb7accf71666c85b13b440c727c3</originalsourceid><addsrcrecordid>eNqFkU9v1DAQxS0EotvCV0CWuMAhyzh_7ITbdikUaVUu27NlO_auV4m92E5Rvz2O0oLEhZOl59_Mm5mHECawJkDop9Na-fEsrR91vy6hrLPcNh28QCvSsq6ApqpfohUAgaJuy-YCXcZ4AoAaKniNLqqW0QooXaHxS7EPm5TunU6f8d7_EqHHx0cZbI-3d3dFCsJF48OoAxZBHW3SKk1B46zhg3Y6WIWF63GcZBo0jvowapdEst5h63Ae0CoxYDuKg45v0CsjhqjfPr1X6P7rzX57W-x-fPu-3ewKVdUUit7InnUgRFNKA6JVdWeMbhlTgjAlTcmMbIxkQinDCKVUtY0klaxrUKxkqrpCH5e-RzHwc8jm4ZF7YfntZsdnDbINI1A-kMx-WNhz8D8nHRMfbVR6GITTfoq8goYBa_JFM_r-H_Tkp-DyJpnKDSkl3dywXSgVfIxBmz8TEOBzfPzE_8bH5_j4El8uffdkMMn577nwOa8MXC-Aztd7sDrwqKx2Kl855GB47-3_XX4DVu2wyA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3060766191</pqid></control><display><type>article</type><title>D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images</title><source>Elsevier</source><creator>Bougourzi, Fares ; Dornaika, Fadi ; Distante, Cosimo ; Taleb-Ahmed, Abdelmalik</creator><creatorcontrib>Bougourzi, Fares ; Dornaika, Fadi ; Distante, Cosimo ; Taleb-Ahmed, Abdelmalik</creatorcontrib><description>Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. •We propose an Efficient Hybrid CNN-Transformer architecture.•Our proposed approach Encoder has Transformer path and Encoders Fusion.•Our approach is designed with Multi-tasks infection-organ-attention Decoders.•The performance is evaluated for Segmenting Bone Metastases and Covid-19 infection.•The proposed Hybrid Encoder has been evaluated for Segmenting Glands and Nuclei.</description><identifier>ISSN: 0010-4825</identifier><identifier>ISSN: 1879-0534</identifier><identifier>EISSN: 1879-0534</identifier><identifier>DOI: 10.1016/j.compbiomed.2024.108590</identifier><identifier>PMID: 38763066</identifier><language>eng</language><publisher>United States: Elsevier Ltd</publisher><subject>Adaptability ; Bone Metastasis ; Coders ; Computer Science ; Convolutional Neural Network ; COVID-19 ; COVID-19 - diagnostic imaging ; Decoders ; Deep learning ; Humans ; Image analysis ; Image Interpretation, Computer-Assisted - methods ; Image processing ; Image Processing, Computer-Assisted - methods ; Image segmentation ; Lesions ; Machine Learning ; Medical Imaging ; Metastases ; Neural Networks, Computer ; Nuclei ; Organs ; SARS-CoV-2 ; Segmentation ; Transformer ; Unet</subject><ispartof>Computers in biology and medicine, 2024-06, Vol.176, p.108590, Article 108590</ispartof><rights>2024 The Author(s)</rights><rights>Copyright © 2024 The Author(s). Published by Elsevier Ltd.. All rights reserved.</rights><rights>2024. The Author(s)</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3460-dfbd790aa52bf0a8c49ffe877ca17cbf27fb5fb7accf71666c85b13b440c727c3</cites><orcidid>0000-0001-6581-9680 ; 0000-0001-5077-4862 ; 0000-0002-1073-2390 ; 0000-0001-7218-3799 ; 0000-0001-8750-1905</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27903,27904</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38763066$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-04607102$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Bougourzi, Fares</creatorcontrib><creatorcontrib>Dornaika, Fadi</creatorcontrib><creatorcontrib>Distante, Cosimo</creatorcontrib><creatorcontrib>Taleb-Ahmed, Abdelmalik</creatorcontrib><title>D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images</title><title>Computers in biology and medicine</title><addtitle>Comput Biol Med</addtitle><description>Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. •We propose an Efficient Hybrid CNN-Transformer architecture.•Our proposed approach Encoder has Transformer path and Encoders Fusion.•Our approach is designed with Multi-tasks infection-organ-attention Decoders.•The performance is evaluated for Segmenting Bone Metastases and Covid-19 infection.•The proposed Hybrid Encoder has been evaluated for Segmenting Glands and Nuclei.</description><subject>Adaptability</subject><subject>Bone Metastasis</subject><subject>Coders</subject><subject>Computer Science</subject><subject>Convolutional Neural Network</subject><subject>COVID-19</subject><subject>COVID-19 - diagnostic imaging</subject><subject>Decoders</subject><subject>Deep learning</subject><subject>Humans</subject><subject>Image analysis</subject><subject>Image Interpretation, Computer-Assisted - methods</subject><subject>Image processing</subject><subject>Image Processing, Computer-Assisted - methods</subject><subject>Image segmentation</subject><subject>Lesions</subject><subject>Machine Learning</subject><subject>Medical Imaging</subject><subject>Metastases</subject><subject>Neural Networks, Computer</subject><subject>Nuclei</subject><subject>Organs</subject><subject>SARS-CoV-2</subject><subject>Segmentation</subject><subject>Transformer</subject><subject>Unet</subject><issn>0010-4825</issn><issn>1879-0534</issn><issn>1879-0534</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNqFkU9v1DAQxS0EotvCV0CWuMAhyzh_7ITbdikUaVUu27NlO_auV4m92E5Rvz2O0oLEhZOl59_Mm5mHECawJkDop9Na-fEsrR91vy6hrLPcNh28QCvSsq6ApqpfohUAgaJuy-YCXcZ4AoAaKniNLqqW0QooXaHxS7EPm5TunU6f8d7_EqHHx0cZbI-3d3dFCsJF48OoAxZBHW3SKk1B46zhg3Y6WIWF63GcZBo0jvowapdEst5h63Ae0CoxYDuKg45v0CsjhqjfPr1X6P7rzX57W-x-fPu-3ewKVdUUit7InnUgRFNKA6JVdWeMbhlTgjAlTcmMbIxkQinDCKVUtY0klaxrUKxkqrpCH5e-RzHwc8jm4ZF7YfntZsdnDbINI1A-kMx-WNhz8D8nHRMfbVR6GITTfoq8goYBa_JFM_r-H_Tkp-DyJpnKDSkl3dywXSgVfIxBmz8TEOBzfPzE_8bH5_j4El8uffdkMMn577nwOa8MXC-Aztd7sDrwqKx2Kl855GB47-3_XX4DVu2wyA</recordid><startdate>20240601</startdate><enddate>20240601</enddate><creator>Bougourzi, Fares</creator><creator>Dornaika, Fadi</creator><creator>Distante, Cosimo</creator><creator>Taleb-Ahmed, Abdelmalik</creator><general>Elsevier Ltd</general><general>Elsevier Limited</general><general>Elsevier</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>K9.</scope><scope>M7Z</scope><scope>NAPCQ</scope><scope>P64</scope><scope>7X8</scope><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0001-6581-9680</orcidid><orcidid>https://orcid.org/0000-0001-5077-4862</orcidid><orcidid>https://orcid.org/0000-0002-1073-2390</orcidid><orcidid>https://orcid.org/0000-0001-7218-3799</orcidid><orcidid>https://orcid.org/0000-0001-8750-1905</orcidid></search><sort><creationdate>20240601</creationdate><title>D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images</title><author>Bougourzi, Fares ; Dornaika, Fadi ; Distante, Cosimo ; Taleb-Ahmed, Abdelmalik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3460-dfbd790aa52bf0a8c49ffe877ca17cbf27fb5fb7accf71666c85b13b440c727c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Adaptability</topic><topic>Bone Metastasis</topic><topic>Coders</topic><topic>Computer Science</topic><topic>Convolutional Neural Network</topic><topic>COVID-19</topic><topic>COVID-19 - diagnostic imaging</topic><topic>Decoders</topic><topic>Deep learning</topic><topic>Humans</topic><topic>Image analysis</topic><topic>Image Interpretation, Computer-Assisted - methods</topic><topic>Image processing</topic><topic>Image Processing, Computer-Assisted - methods</topic><topic>Image segmentation</topic><topic>Lesions</topic><topic>Machine Learning</topic><topic>Medical Imaging</topic><topic>Metastases</topic><topic>Neural Networks, Computer</topic><topic>Nuclei</topic><topic>Organs</topic><topic>SARS-CoV-2</topic><topic>Segmentation</topic><topic>Transformer</topic><topic>Unet</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Bougourzi, Fares</creatorcontrib><creatorcontrib>Dornaika, Fadi</creatorcontrib><creatorcontrib>Distante, Cosimo</creatorcontrib><creatorcontrib>Taleb-Ahmed, Abdelmalik</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biochemistry Abstracts 1</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>Computers in biology and medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bougourzi, Fares</au><au>Dornaika, Fadi</au><au>Distante, Cosimo</au><au>Taleb-Ahmed, Abdelmalik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images</atitle><jtitle>Computers in biology and medicine</jtitle><addtitle>Comput Biol Med</addtitle><date>2024-06-01</date><risdate>2024</risdate><volume>176</volume><spage>108590</spage><pages>108590-</pages><artnum>108590</artnum><issn>0010-4825</issn><issn>1879-0534</issn><eissn>1879-0534</eissn><abstract>Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. •We propose an Efficient Hybrid CNN-Transformer architecture.•Our proposed approach Encoder has Transformer path and Encoders Fusion.•Our approach is designed with Multi-tasks infection-organ-attention Decoders.•The performance is evaluated for Segmenting Bone Metastases and Covid-19 infection.•The proposed Hybrid Encoder has been evaluated for Segmenting Glands and Nuclei.</abstract><cop>United States</cop><pub>Elsevier Ltd</pub><pmid>38763066</pmid><doi>10.1016/j.compbiomed.2024.108590</doi><orcidid>https://orcid.org/0000-0001-6581-9680</orcidid><orcidid>https://orcid.org/0000-0001-5077-4862</orcidid><orcidid>https://orcid.org/0000-0002-1073-2390</orcidid><orcidid>https://orcid.org/0000-0001-7218-3799</orcidid><orcidid>https://orcid.org/0000-0001-8750-1905</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0010-4825
ispartof Computers in biology and medicine, 2024-06, Vol.176, p.108590, Article 108590
issn 0010-4825
1879-0534
1879-0534
language eng
recordid cdi_hal_primary_oai_HAL_hal_04607102v1
source Elsevier
subjects Adaptability
Bone Metastasis
Coders
Computer Science
Convolutional Neural Network
COVID-19
COVID-19 - diagnostic imaging
Decoders
Deep learning
Humans
Image analysis
Image Interpretation, Computer-Assisted - methods
Image processing
Image Processing, Computer-Assisted - methods
Image segmentation
Lesions
Machine Learning
Medical Imaging
Metastases
Neural Networks, Computer
Nuclei
Organs
SARS-CoV-2
Segmentation
Transformer
Unet
title D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T11%3A19%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=D-TrAttUnet:%20Toward%20hybrid%20CNN-transformer%20architecture%20for%20generic%20and%20subtle%20segmentation%20in%20medical%20images&rft.jtitle=Computers%20in%20biology%20and%20medicine&rft.au=Bougourzi,%20Fares&rft.date=2024-06-01&rft.volume=176&rft.spage=108590&rft.pages=108590-&rft.artnum=108590&rft.issn=0010-4825&rft.eissn=1879-0534&rft_id=info:doi/10.1016/j.compbiomed.2024.108590&rft_dat=%3Cproquest_hal_p%3E3060766191%3C/proquest_hal_p%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3460-dfbd790aa52bf0a8c49ffe877ca17cbf27fb5fb7accf71666c85b13b440c727c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3060766191&rft_id=info:pmid/38763066&rfr_iscdi=true