Loading…

MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging

Here, this work extends our previous research entitled "MemXCT: Memory-centric X-ray CT Reconstruction with Massive Parallelization" that was originally published at SC19 conference (Hidayetoglu et al., 2019) with reproducibility of the computational imaging performance. X-ray computed tom...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on parallel and distributed systems 2021-11, Vol.33 (9)
Main Authors: Hidayetoglu, Mert, Bicer, Tekin, de Gonzalo, Simon Garcia, Ren, Bin, Gursoy, Doga, Kettimuthu, Rajkumar, Foster, Ian T., Hwu, Wen-Mei W.
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue 9
container_start_page
container_title IEEE transactions on parallel and distributed systems
container_volume 33
creator Hidayetoglu, Mert
Bicer, Tekin
de Gonzalo, Simon Garcia
Ren, Bin
Gursoy, Doga
Kettimuthu, Rajkumar
Foster, Ian T.
Hwu, Wen-Mei W.
description Here, this work extends our previous research entitled "MemXCT: Memory-centric X-ray CT Reconstruction with Massive Parallelization" that was originally published at SC19 conference (Hidayetoglu et al., 2019) with reproducibility of the computational imaging performance. X-ray computed tomography (XCT) is regularly used at synchrotron light sources to study the internal morphology of materials at high resolution. However, experimental constraints, such as radiation sensitivity, can result in noisy or undersampled measurements. Further, depending on the resolution, sample size and data acquisition rates, the resulting noisy dataset can be in the order of terabytes. Advanced iterative reconstruction techniques can produce high-quality images from noisy measurements, but their computational requirements have made their use an exception rather than the rule. We propose a novel memory-centric approach that avoids redundant computations at the expense of additional memory complexity. We develop a memory-centric iterative reconstruction system, MemXCT, that uses an optimized SpMV implementation with two-level pseudo-Hilbert ordering and multi-stage input buffering. We evaluate MemXCT on various supercomputer architectures involving KNL and GPU. MemXCT can reconstruct a large (11Kx11K) mouse brain tomogram in 10 seconds using 4096 KNL nodes (256K cores). The results presented in our original article at the SC19 were based on large-scale supercomputing resources. The MemXCT application was selected for the Student Cluster Competition (SCC) Reproducibility Challenge and evaluated on a variety of cloud computing resources by universities around the world in the SC20 conference. We summarize the results of the top-ranked SCC Reproducibility Challenge teams and identify the most pertinent measures for ensuring the reproducibility of our experiments in this article.
format article
fullrecord <record><control><sourceid>osti</sourceid><recordid>TN_cdi_osti_scitechconnect_1871328</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1871328</sourcerecordid><originalsourceid>FETCH-osti_scitechconnect_18713283</originalsourceid><addsrcrecordid>eNqNi08LgjAchkcUZH--w-ic4NSRdrWiDhGYhDdZc85f6CZuHezT56EP0Ol9HnjeCXIIpZHrkyiYjuyF1I19Es_RwpiX55GQeqGDHlfR5km2xwdhQKotvnUWWvgwC3q0O2cNKLnFTJU4FV2vyzeHJzRgB6wrnLspG3CmWy171tUDvrRMjocVmlWsMWL92yXanI5Zcna1sVAYDlbwmmulBLcFiXYk8KPgr-gL7vlBUA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging</title><source>IEEE Xplore (Online service)</source><creator>Hidayetoglu, Mert ; Bicer, Tekin ; de Gonzalo, Simon Garcia ; Ren, Bin ; Gursoy, Doga ; Kettimuthu, Rajkumar ; Foster, Ian T. ; Hwu, Wen-Mei W.</creator><creatorcontrib>Hidayetoglu, Mert ; Bicer, Tekin ; de Gonzalo, Simon Garcia ; Ren, Bin ; Gursoy, Doga ; Kettimuthu, Rajkumar ; Foster, Ian T. ; Hwu, Wen-Mei W. ; Argonne National Lab. (ANL), Argonne, IL (United States)</creatorcontrib><description>Here, this work extends our previous research entitled "MemXCT: Memory-centric X-ray CT Reconstruction with Massive Parallelization" that was originally published at SC19 conference (Hidayetoglu et al., 2019) with reproducibility of the computational imaging performance. X-ray computed tomography (XCT) is regularly used at synchrotron light sources to study the internal morphology of materials at high resolution. However, experimental constraints, such as radiation sensitivity, can result in noisy or undersampled measurements. Further, depending on the resolution, sample size and data acquisition rates, the resulting noisy dataset can be in the order of terabytes. Advanced iterative reconstruction techniques can produce high-quality images from noisy measurements, but their computational requirements have made their use an exception rather than the rule. We propose a novel memory-centric approach that avoids redundant computations at the expense of additional memory complexity. We develop a memory-centric iterative reconstruction system, MemXCT, that uses an optimized SpMV implementation with two-level pseudo-Hilbert ordering and multi-stage input buffering. We evaluate MemXCT on various supercomputer architectures involving KNL and GPU. MemXCT can reconstruct a large (11Kx11K) mouse brain tomogram in 10 seconds using 4096 KNL nodes (256K cores). The results presented in our original article at the SC19 were based on large-scale supercomputing resources. The MemXCT application was selected for the Student Cluster Competition (SCC) Reproducibility Challenge and evaluated on a variety of cloud computing resources by universities around the world in the SC20 conference. We summarize the results of the top-ranked SCC Reproducibility Challenge teams and identify the most pertinent measures for ensuring the reproducibility of our experiments in this article.</description><identifier>ISSN: 1045-9219</identifier><identifier>EISSN: 1558-2183</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>cache utilization ; GPU computing ; Knights Landing ; OTHER INSTRUMENTATION ; space-filling curves ; SpMV ; x-ray tomography</subject><ispartof>IEEE transactions on parallel and distributed systems, 2021-11, Vol.33 (9)</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000000284285159 ; 0000000192765075 ; 0000000321295269 ; 0000000325325349 ; 0000000200808101</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885</link.rule.ids><backlink>$$Uhttps://www.osti.gov/servlets/purl/1871328$$D View this record in Osti.gov$$Hfree_for_read</backlink></links><search><creatorcontrib>Hidayetoglu, Mert</creatorcontrib><creatorcontrib>Bicer, Tekin</creatorcontrib><creatorcontrib>de Gonzalo, Simon Garcia</creatorcontrib><creatorcontrib>Ren, Bin</creatorcontrib><creatorcontrib>Gursoy, Doga</creatorcontrib><creatorcontrib>Kettimuthu, Rajkumar</creatorcontrib><creatorcontrib>Foster, Ian T.</creatorcontrib><creatorcontrib>Hwu, Wen-Mei W.</creatorcontrib><creatorcontrib>Argonne National Lab. (ANL), Argonne, IL (United States)</creatorcontrib><title>MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging</title><title>IEEE transactions on parallel and distributed systems</title><description>Here, this work extends our previous research entitled "MemXCT: Memory-centric X-ray CT Reconstruction with Massive Parallelization" that was originally published at SC19 conference (Hidayetoglu et al., 2019) with reproducibility of the computational imaging performance. X-ray computed tomography (XCT) is regularly used at synchrotron light sources to study the internal morphology of materials at high resolution. However, experimental constraints, such as radiation sensitivity, can result in noisy or undersampled measurements. Further, depending on the resolution, sample size and data acquisition rates, the resulting noisy dataset can be in the order of terabytes. Advanced iterative reconstruction techniques can produce high-quality images from noisy measurements, but their computational requirements have made their use an exception rather than the rule. We propose a novel memory-centric approach that avoids redundant computations at the expense of additional memory complexity. We develop a memory-centric iterative reconstruction system, MemXCT, that uses an optimized SpMV implementation with two-level pseudo-Hilbert ordering and multi-stage input buffering. We evaluate MemXCT on various supercomputer architectures involving KNL and GPU. MemXCT can reconstruct a large (11Kx11K) mouse brain tomogram in 10 seconds using 4096 KNL nodes (256K cores). The results presented in our original article at the SC19 were based on large-scale supercomputing resources. The MemXCT application was selected for the Student Cluster Competition (SCC) Reproducibility Challenge and evaluated on a variety of cloud computing resources by universities around the world in the SC20 conference. We summarize the results of the top-ranked SCC Reproducibility Challenge teams and identify the most pertinent measures for ensuring the reproducibility of our experiments in this article.</description><subject>cache utilization</subject><subject>GPU computing</subject><subject>Knights Landing</subject><subject>OTHER INSTRUMENTATION</subject><subject>space-filling curves</subject><subject>SpMV</subject><subject>x-ray tomography</subject><issn>1045-9219</issn><issn>1558-2183</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqNi08LgjAchkcUZH--w-ic4NSRdrWiDhGYhDdZc85f6CZuHezT56EP0Ol9HnjeCXIIpZHrkyiYjuyF1I19Es_RwpiX55GQeqGDHlfR5km2xwdhQKotvnUWWvgwC3q0O2cNKLnFTJU4FV2vyzeHJzRgB6wrnLspG3CmWy171tUDvrRMjocVmlWsMWL92yXanI5Zcna1sVAYDlbwmmulBLcFiXYk8KPgr-gL7vlBUA</recordid><startdate>20211123</startdate><enddate>20211123</enddate><creator>Hidayetoglu, Mert</creator><creator>Bicer, Tekin</creator><creator>de Gonzalo, Simon Garcia</creator><creator>Ren, Bin</creator><creator>Gursoy, Doga</creator><creator>Kettimuthu, Rajkumar</creator><creator>Foster, Ian T.</creator><creator>Hwu, Wen-Mei W.</creator><general>IEEE</general><scope>OIOZB</scope><scope>OTOTI</scope><orcidid>https://orcid.org/0000000284285159</orcidid><orcidid>https://orcid.org/0000000192765075</orcidid><orcidid>https://orcid.org/0000000321295269</orcidid><orcidid>https://orcid.org/0000000325325349</orcidid><orcidid>https://orcid.org/0000000200808101</orcidid></search><sort><creationdate>20211123</creationdate><title>MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging</title><author>Hidayetoglu, Mert ; Bicer, Tekin ; de Gonzalo, Simon Garcia ; Ren, Bin ; Gursoy, Doga ; Kettimuthu, Rajkumar ; Foster, Ian T. ; Hwu, Wen-Mei W.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-osti_scitechconnect_18713283</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>cache utilization</topic><topic>GPU computing</topic><topic>Knights Landing</topic><topic>OTHER INSTRUMENTATION</topic><topic>space-filling curves</topic><topic>SpMV</topic><topic>x-ray tomography</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hidayetoglu, Mert</creatorcontrib><creatorcontrib>Bicer, Tekin</creatorcontrib><creatorcontrib>de Gonzalo, Simon Garcia</creatorcontrib><creatorcontrib>Ren, Bin</creatorcontrib><creatorcontrib>Gursoy, Doga</creatorcontrib><creatorcontrib>Kettimuthu, Rajkumar</creatorcontrib><creatorcontrib>Foster, Ian T.</creatorcontrib><creatorcontrib>Hwu, Wen-Mei W.</creatorcontrib><creatorcontrib>Argonne National Lab. (ANL), Argonne, IL (United States)</creatorcontrib><collection>OSTI.GOV - Hybrid</collection><collection>OSTI.GOV</collection><jtitle>IEEE transactions on parallel and distributed systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hidayetoglu, Mert</au><au>Bicer, Tekin</au><au>de Gonzalo, Simon Garcia</au><au>Ren, Bin</au><au>Gursoy, Doga</au><au>Kettimuthu, Rajkumar</au><au>Foster, Ian T.</au><au>Hwu, Wen-Mei W.</au><aucorp>Argonne National Lab. (ANL), Argonne, IL (United States)</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging</atitle><jtitle>IEEE transactions on parallel and distributed systems</jtitle><date>2021-11-23</date><risdate>2021</risdate><volume>33</volume><issue>9</issue><issn>1045-9219</issn><eissn>1558-2183</eissn><abstract>Here, this work extends our previous research entitled "MemXCT: Memory-centric X-ray CT Reconstruction with Massive Parallelization" that was originally published at SC19 conference (Hidayetoglu et al., 2019) with reproducibility of the computational imaging performance. X-ray computed tomography (XCT) is regularly used at synchrotron light sources to study the internal morphology of materials at high resolution. However, experimental constraints, such as radiation sensitivity, can result in noisy or undersampled measurements. Further, depending on the resolution, sample size and data acquisition rates, the resulting noisy dataset can be in the order of terabytes. Advanced iterative reconstruction techniques can produce high-quality images from noisy measurements, but their computational requirements have made their use an exception rather than the rule. We propose a novel memory-centric approach that avoids redundant computations at the expense of additional memory complexity. We develop a memory-centric iterative reconstruction system, MemXCT, that uses an optimized SpMV implementation with two-level pseudo-Hilbert ordering and multi-stage input buffering. We evaluate MemXCT on various supercomputer architectures involving KNL and GPU. MemXCT can reconstruct a large (11Kx11K) mouse brain tomogram in 10 seconds using 4096 KNL nodes (256K cores). The results presented in our original article at the SC19 were based on large-scale supercomputing resources. The MemXCT application was selected for the Student Cluster Competition (SCC) Reproducibility Challenge and evaluated on a variety of cloud computing resources by universities around the world in the SC20 conference. We summarize the results of the top-ranked SCC Reproducibility Challenge teams and identify the most pertinent measures for ensuring the reproducibility of our experiments in this article.</abstract><cop>United States</cop><pub>IEEE</pub><orcidid>https://orcid.org/0000000284285159</orcidid><orcidid>https://orcid.org/0000000192765075</orcidid><orcidid>https://orcid.org/0000000321295269</orcidid><orcidid>https://orcid.org/0000000325325349</orcidid><orcidid>https://orcid.org/0000000200808101</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1045-9219
ispartof IEEE transactions on parallel and distributed systems, 2021-11, Vol.33 (9)
issn 1045-9219
1558-2183
language eng
recordid cdi_osti_scitechconnect_1871328
source IEEE Xplore (Online service)
subjects cache utilization
GPU computing
Knights Landing
OTHER INSTRUMENTATION
space-filling curves
SpMV
x-ray tomography
title MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T00%3A03%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-osti&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MemXCT:%20Design,%20Optimization,%20Scaling,%20and%20Reproducibility%20of%20X-Ray%20Tomography%20Imaging&rft.jtitle=IEEE%20transactions%20on%20parallel%20and%20distributed%20systems&rft.au=Hidayetoglu,%20Mert&rft.aucorp=Argonne%20National%20Lab.%20(ANL),%20Argonne,%20IL%20(United%20States)&rft.date=2021-11-23&rft.volume=33&rft.issue=9&rft.issn=1045-9219&rft.eissn=1558-2183&rft_id=info:doi/&rft_dat=%3Costi%3E1871328%3C/osti%3E%3Cgrp_id%3Ecdi_FETCH-osti_scitechconnect_18713283%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true