Loading…

Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing

Robust and fast solutions for anatomical object detection and segmentation support the entire clinical workflow from diagnosis, patient stratification, therapy planning, intervention and follow-up. Current state-of-the-art techniques for parsing volumetric medical image data are typically based on m...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on medical imaging 2016-05, Vol.35 (5), p.1217-1228
Main Authors: Ghesu, Florin C., Krubasik, Edward, Georgescu, Bogdan, Singh, Vivek, Yefeng Zheng, Hornegger, Joachim, Comaniciu, Dorin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3
cites cdi_FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3
container_end_page 1228
container_issue 5
container_start_page 1217
container_title IEEE transactions on medical imaging
container_volume 35
creator Ghesu, Florin C.
Krubasik, Edward
Georgescu, Bogdan
Singh, Vivek
Yefeng Zheng
Hornegger, Joachim
Comaniciu, Dorin
description Robust and fast solutions for anatomical object detection and segmentation support the entire clinical workflow from diagnosis, patient stratification, therapy planning, intervention and follow-up. Current state-of-the-art techniques for parsing volumetric medical image data are typically based on machine learning methods that exploit large annotated image databases. Two main challenges need to be addressed, these are the efficiency in scanning high-dimensional parametric spaces and the need for representative image features which require significant efforts of manual engineering. We propose a pipeline for object detection and segmentation in the context of volumetric image parsing, solving a two-step learning problem: anatomical pose estimation and boundary delineation. For this task we introduce Marginal Space Deep Learning (MSDL), a novel framework exploiting both the strengths of efficient object parametrization in hierarchical marginal spaces and the automated feature design of Deep Learning (DL) network architectures. In the 3D context, the application of deep learning systems is limited by the very high complexity of the parametrization. More specifically 9 parameters are necessary to describe a restricted affine transformation in 3D, resulting in a prohibitive amount of billions of scanning hypotheses. The mechanism of marginal space learning provides excellent run-time performance by learning classifiers in clustered, high-probability regions in spaces of gradually increasing dimensionality. To further increase computational efficiency and robustness, in our system we learn sparse adaptive data sampling patterns that automatically capture the structure of the input. Given the object localization, we propose a DL-based active shape model to estimate the non-rigid object boundary. Experimental results are presented on the aortic valve in ultrasound using an extensive dataset of 2891 volumes from 869 patients, showing significant improvements of up to 45.2% over the state-of-the-art. To our knowledge, this is the first successful demonstration of the DL potential to detection and segmentation in full 3D data with parametrized representations.
doi_str_mv 10.1109/TMI.2016.2538802
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_miscellaneous_1816044939</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7426845</ieee_id><sourcerecordid>1816044939</sourcerecordid><originalsourceid>FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3</originalsourceid><addsrcrecordid>eNqNkT1PHDEQhi2UCC5AHylSZClNmr2Mvf5MhwgJJx2CKIDoVj4zvhjtx2HvFvn3GN2FgopqinneV6N5CPnIYM4Y2G_XF4s5B6bmXNbGAN8jMyalqbgUd-_IDLg2FYDiB-RDzg8ATEiw--SAaxDKCDUjvy9cWsfetfTPxnmkPxA3dIku9bFff6dnIUQfsR_pSfJ_44h-nBLSMCR6O7RTh2OKni46t0Z65VIuoSPyPrg24_FuHpKbn2fXp-fV8vLX4vRkWXmh7VhZL41XpnYaEBTTwde8lhzlyoMP5XCuMATjGd5rx70JsArBi3K2E5Z5rA_J123vJg2PE-ax6WL22Laux2HKDSsNSlsF6g0oUyCEre0bUDCKC6NEQb-8Qh-GKZVXFkobzayRlhcKtpRPQ84JQ7NJsXPpX8OgeZbYFInNs8RmJ7FEPu-Kp1WH9y-B_9YK8GkLRER8WWvBy1bWTwA7nlo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1787198592</pqid></control><display><type>article</type><title>Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Ghesu, Florin C. ; Krubasik, Edward ; Georgescu, Bogdan ; Singh, Vivek ; Yefeng Zheng ; Hornegger, Joachim ; Comaniciu, Dorin</creator><creatorcontrib>Ghesu, Florin C. ; Krubasik, Edward ; Georgescu, Bogdan ; Singh, Vivek ; Yefeng Zheng ; Hornegger, Joachim ; Comaniciu, Dorin</creatorcontrib><description>Robust and fast solutions for anatomical object detection and segmentation support the entire clinical workflow from diagnosis, patient stratification, therapy planning, intervention and follow-up. Current state-of-the-art techniques for parsing volumetric medical image data are typically based on machine learning methods that exploit large annotated image databases. Two main challenges need to be addressed, these are the efficiency in scanning high-dimensional parametric spaces and the need for representative image features which require significant efforts of manual engineering. We propose a pipeline for object detection and segmentation in the context of volumetric image parsing, solving a two-step learning problem: anatomical pose estimation and boundary delineation. For this task we introduce Marginal Space Deep Learning (MSDL), a novel framework exploiting both the strengths of efficient object parametrization in hierarchical marginal spaces and the automated feature design of Deep Learning (DL) network architectures. In the 3D context, the application of deep learning systems is limited by the very high complexity of the parametrization. More specifically 9 parameters are necessary to describe a restricted affine transformation in 3D, resulting in a prohibitive amount of billions of scanning hypotheses. The mechanism of marginal space learning provides excellent run-time performance by learning classifiers in clustered, high-probability regions in spaces of gradually increasing dimensionality. To further increase computational efficiency and robustness, in our system we learn sparse adaptive data sampling patterns that automatically capture the structure of the input. Given the object localization, we propose a DL-based active shape model to estimate the non-rigid object boundary. Experimental results are presented on the aortic valve in ultrasound using an extensive dataset of 2891 volumes from 869 patients, showing significant improvements of up to 45.2% over the state-of-the-art. To our knowledge, this is the first successful demonstration of the DL potential to detection and segmentation in full 3D data with parametrized representations.</description><identifier>ISSN: 0278-0062</identifier><identifier>EISSN: 1558-254X</identifier><identifier>DOI: 10.1109/TMI.2016.2538802</identifier><identifier>PMID: 27046846</identifier><identifier>CODEN: ITMID4</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Algorithms ; Aortic Valve - diagnostic imaging ; Context ; Databases, Factual ; Deep learning ; Echocardiography, Transesophageal ; Feature extraction ; Humans ; image parsing ; Image Processing, Computer-Assisted - methods ; Image segmentation ; Learning ; Machine Learning ; marginal space learning ; Mathematical models ; Medical imaging ; Neural Networks (Computer) ; Parametrization ; Patients ; Pattern Recognition, Automated - methods ; Robustness ; Segmentation ; Shape ; sparse representations ; Three dimensional ; three-dimensional (3D) object detection and segmentation ; Three-dimensional displays ; Volumetric analysis</subject><ispartof>IEEE transactions on medical imaging, 2016-05, Vol.35 (5), p.1217-1228</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3</citedby><cites>FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3</cites><orcidid>0000-0002-3376-4002</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7426845$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,54771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27046846$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ghesu, Florin C.</creatorcontrib><creatorcontrib>Krubasik, Edward</creatorcontrib><creatorcontrib>Georgescu, Bogdan</creatorcontrib><creatorcontrib>Singh, Vivek</creatorcontrib><creatorcontrib>Yefeng Zheng</creatorcontrib><creatorcontrib>Hornegger, Joachim</creatorcontrib><creatorcontrib>Comaniciu, Dorin</creatorcontrib><title>Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing</title><title>IEEE transactions on medical imaging</title><addtitle>TMI</addtitle><addtitle>IEEE Trans Med Imaging</addtitle><description>Robust and fast solutions for anatomical object detection and segmentation support the entire clinical workflow from diagnosis, patient stratification, therapy planning, intervention and follow-up. Current state-of-the-art techniques for parsing volumetric medical image data are typically based on machine learning methods that exploit large annotated image databases. Two main challenges need to be addressed, these are the efficiency in scanning high-dimensional parametric spaces and the need for representative image features which require significant efforts of manual engineering. We propose a pipeline for object detection and segmentation in the context of volumetric image parsing, solving a two-step learning problem: anatomical pose estimation and boundary delineation. For this task we introduce Marginal Space Deep Learning (MSDL), a novel framework exploiting both the strengths of efficient object parametrization in hierarchical marginal spaces and the automated feature design of Deep Learning (DL) network architectures. In the 3D context, the application of deep learning systems is limited by the very high complexity of the parametrization. More specifically 9 parameters are necessary to describe a restricted affine transformation in 3D, resulting in a prohibitive amount of billions of scanning hypotheses. The mechanism of marginal space learning provides excellent run-time performance by learning classifiers in clustered, high-probability regions in spaces of gradually increasing dimensionality. To further increase computational efficiency and robustness, in our system we learn sparse adaptive data sampling patterns that automatically capture the structure of the input. Given the object localization, we propose a DL-based active shape model to estimate the non-rigid object boundary. Experimental results are presented on the aortic valve in ultrasound using an extensive dataset of 2891 volumes from 869 patients, showing significant improvements of up to 45.2% over the state-of-the-art. To our knowledge, this is the first successful demonstration of the DL potential to detection and segmentation in full 3D data with parametrized representations.</description><subject>Algorithms</subject><subject>Aortic Valve - diagnostic imaging</subject><subject>Context</subject><subject>Databases, Factual</subject><subject>Deep learning</subject><subject>Echocardiography, Transesophageal</subject><subject>Feature extraction</subject><subject>Humans</subject><subject>image parsing</subject><subject>Image Processing, Computer-Assisted - methods</subject><subject>Image segmentation</subject><subject>Learning</subject><subject>Machine Learning</subject><subject>marginal space learning</subject><subject>Mathematical models</subject><subject>Medical imaging</subject><subject>Neural Networks (Computer)</subject><subject>Parametrization</subject><subject>Patients</subject><subject>Pattern Recognition, Automated - methods</subject><subject>Robustness</subject><subject>Segmentation</subject><subject>Shape</subject><subject>sparse representations</subject><subject>Three dimensional</subject><subject>three-dimensional (3D) object detection and segmentation</subject><subject>Three-dimensional displays</subject><subject>Volumetric analysis</subject><issn>0278-0062</issn><issn>1558-254X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNqNkT1PHDEQhi2UCC5AHylSZClNmr2Mvf5MhwgJJx2CKIDoVj4zvhjtx2HvFvn3GN2FgopqinneV6N5CPnIYM4Y2G_XF4s5B6bmXNbGAN8jMyalqbgUd-_IDLg2FYDiB-RDzg8ATEiw--SAaxDKCDUjvy9cWsfetfTPxnmkPxA3dIku9bFff6dnIUQfsR_pSfJ_44h-nBLSMCR6O7RTh2OKni46t0Z65VIuoSPyPrg24_FuHpKbn2fXp-fV8vLX4vRkWXmh7VhZL41XpnYaEBTTwde8lhzlyoMP5XCuMATjGd5rx70JsArBi3K2E5Z5rA_J123vJg2PE-ax6WL22Laux2HKDSsNSlsF6g0oUyCEre0bUDCKC6NEQb-8Qh-GKZVXFkobzayRlhcKtpRPQ84JQ7NJsXPpX8OgeZbYFInNs8RmJ7FEPu-Kp1WH9y-B_9YK8GkLRER8WWvBy1bWTwA7nlo</recordid><startdate>201605</startdate><enddate>201605</enddate><creator>Ghesu, Florin C.</creator><creator>Krubasik, Edward</creator><creator>Georgescu, Bogdan</creator><creator>Singh, Vivek</creator><creator>Yefeng Zheng</creator><creator>Hornegger, Joachim</creator><creator>Comaniciu, Dorin</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>JG9</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>NAPCQ</scope><scope>P64</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-3376-4002</orcidid></search><sort><creationdate>201605</creationdate><title>Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing</title><author>Ghesu, Florin C. ; Krubasik, Edward ; Georgescu, Bogdan ; Singh, Vivek ; Yefeng Zheng ; Hornegger, Joachim ; Comaniciu, Dorin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithms</topic><topic>Aortic Valve - diagnostic imaging</topic><topic>Context</topic><topic>Databases, Factual</topic><topic>Deep learning</topic><topic>Echocardiography, Transesophageal</topic><topic>Feature extraction</topic><topic>Humans</topic><topic>image parsing</topic><topic>Image Processing, Computer-Assisted - methods</topic><topic>Image segmentation</topic><topic>Learning</topic><topic>Machine Learning</topic><topic>marginal space learning</topic><topic>Mathematical models</topic><topic>Medical imaging</topic><topic>Neural Networks (Computer)</topic><topic>Parametrization</topic><topic>Patients</topic><topic>Pattern Recognition, Automated - methods</topic><topic>Robustness</topic><topic>Segmentation</topic><topic>Shape</topic><topic>sparse representations</topic><topic>Three dimensional</topic><topic>three-dimensional (3D) object detection and segmentation</topic><topic>Three-dimensional displays</topic><topic>Volumetric analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Ghesu, Florin C.</creatorcontrib><creatorcontrib>Krubasik, Edward</creatorcontrib><creatorcontrib>Georgescu, Bogdan</creatorcontrib><creatorcontrib>Singh, Vivek</creatorcontrib><creatorcontrib>Yefeng Zheng</creatorcontrib><creatorcontrib>Hornegger, Joachim</creatorcontrib><creatorcontrib>Comaniciu, Dorin</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on medical imaging</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ghesu, Florin C.</au><au>Krubasik, Edward</au><au>Georgescu, Bogdan</au><au>Singh, Vivek</au><au>Yefeng Zheng</au><au>Hornegger, Joachim</au><au>Comaniciu, Dorin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing</atitle><jtitle>IEEE transactions on medical imaging</jtitle><stitle>TMI</stitle><addtitle>IEEE Trans Med Imaging</addtitle><date>2016-05</date><risdate>2016</risdate><volume>35</volume><issue>5</issue><spage>1217</spage><epage>1228</epage><pages>1217-1228</pages><issn>0278-0062</issn><eissn>1558-254X</eissn><coden>ITMID4</coden><abstract>Robust and fast solutions for anatomical object detection and segmentation support the entire clinical workflow from diagnosis, patient stratification, therapy planning, intervention and follow-up. Current state-of-the-art techniques for parsing volumetric medical image data are typically based on machine learning methods that exploit large annotated image databases. Two main challenges need to be addressed, these are the efficiency in scanning high-dimensional parametric spaces and the need for representative image features which require significant efforts of manual engineering. We propose a pipeline for object detection and segmentation in the context of volumetric image parsing, solving a two-step learning problem: anatomical pose estimation and boundary delineation. For this task we introduce Marginal Space Deep Learning (MSDL), a novel framework exploiting both the strengths of efficient object parametrization in hierarchical marginal spaces and the automated feature design of Deep Learning (DL) network architectures. In the 3D context, the application of deep learning systems is limited by the very high complexity of the parametrization. More specifically 9 parameters are necessary to describe a restricted affine transformation in 3D, resulting in a prohibitive amount of billions of scanning hypotheses. The mechanism of marginal space learning provides excellent run-time performance by learning classifiers in clustered, high-probability regions in spaces of gradually increasing dimensionality. To further increase computational efficiency and robustness, in our system we learn sparse adaptive data sampling patterns that automatically capture the structure of the input. Given the object localization, we propose a DL-based active shape model to estimate the non-rigid object boundary. Experimental results are presented on the aortic valve in ultrasound using an extensive dataset of 2891 volumes from 869 patients, showing significant improvements of up to 45.2% over the state-of-the-art. To our knowledge, this is the first successful demonstration of the DL potential to detection and segmentation in full 3D data with parametrized representations.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>27046846</pmid><doi>10.1109/TMI.2016.2538802</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0002-3376-4002</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0278-0062
ispartof IEEE transactions on medical imaging, 2016-05, Vol.35 (5), p.1217-1228
issn 0278-0062
1558-254X
language eng
recordid cdi_proquest_miscellaneous_1816044939
source IEEE Electronic Library (IEL) Journals
subjects Algorithms
Aortic Valve - diagnostic imaging
Context
Databases, Factual
Deep learning
Echocardiography, Transesophageal
Feature extraction
Humans
image parsing
Image Processing, Computer-Assisted - methods
Image segmentation
Learning
Machine Learning
marginal space learning
Mathematical models
Medical imaging
Neural Networks (Computer)
Parametrization
Patients
Pattern Recognition, Automated - methods
Robustness
Segmentation
Shape
sparse representations
Three dimensional
three-dimensional (3D) object detection and segmentation
Three-dimensional displays
Volumetric analysis
title Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T16%3A36%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Marginal%20Space%20Deep%20Learning:%20Efficient%20Architecture%20for%20Volumetric%20Image%20Parsing&rft.jtitle=IEEE%20transactions%20on%20medical%20imaging&rft.au=Ghesu,%20Florin%20C.&rft.date=2016-05&rft.volume=35&rft.issue=5&rft.spage=1217&rft.epage=1228&rft.pages=1217-1228&rft.issn=0278-0062&rft.eissn=1558-254X&rft.coden=ITMID4&rft_id=info:doi/10.1109/TMI.2016.2538802&rft_dat=%3Cproquest_ieee_%3E1816044939%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c479t-9c58c683a70e0617fc32352e5bc0cf27826eff8c1ed7a2c8f0bffc4468a491ce3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1787198592&rft_id=info:pmid/27046846&rft_ieee_id=7426845&rfr_iscdi=true