Loading…

Real‐time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis

Summary X‐ray scattering experiments using free electron lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID‐19 viral proteins). XFEL experiments are a challenge to computing in two ways: (i) due to the high cost of running XFELs, a...

Full description

Saved in:
Bibliographic Details
Published in:Concurrency and computation 2024-05, Vol.36 (12), p.n/a
Main Authors: Blaschke, Johannes P., Brewster, Aaron S., Paley, Daniel W., Mendez, Derek, Bhowmick, Asmit, Sauter, Nicholas K., Kröger, Wilko, Shankar, Murali, Enders, Bjoern, Bard, Deborah
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c3159-74c43a5d4eab2440ce04c4e80f5bbc9ff8409eaef749fd1a0323452bc0a4053
container_end_page n/a
container_issue 12
container_start_page
container_title Concurrency and computation
container_volume 36
creator Blaschke, Johannes P.
Brewster, Aaron S.
Paley, Daniel W.
Mendez, Derek
Bhowmick, Asmit
Sauter, Nicholas K.
Kröger, Wilko
Shankar, Murali
Enders, Bjoern
Bard, Deborah
description Summary X‐ray scattering experiments using free electron lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID‐19 viral proteins). XFEL experiments are a challenge to computing in two ways: (i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experimental protocols; (ii) data‐collection rates are growing exponentially, requiring new scalable algorithms. Here we report our experiences analyzing data from two experiments at the Linac Coherent Light Source (LCLS) during September 2020. Raw data were analyzed on NERSC's Cori XC40 system, using the Superfacility paradigm: our workflow automatically moves raw data between LCLS and NERSC, where it is analyzed using the software package CCTBX. We achieved real time data analysis with a turnaround time from data acquisition to full molecular reconstruction in as little as 10 min—sufficient time for the experiment's operators to make informed decisions. By hosting the data analysis on Cori, and by automating LCLS‐NERSC interoperability, we achieved a data analysis rate which matches the data acquisition rate. Completing data analysis within 10 min is a first for XFEL experiments and an important milestone if we are to keep up with data‐collection trends.
doi_str_mv 10.1002/cpe.8019
format article
fullrecord <record><control><sourceid>proquest_osti_</sourceid><recordid>TN_cdi_osti_scitechconnect_2322459</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3047706910</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3159-74c43a5d4eab2440ce04c4e80f5bbc9ff8409eaef749fd1a0323452bc0a4053</originalsourceid><addsrcrecordid>eNp1kM9Kw0AQhxdRsFbBR1j04iV19k-axlsJqQpBpfXgbdlsJhiJScxu0d58BJ_RJ3FrRPDgaX4MHz9mPkKOGUwYAD83HU5mwOIdMmKh4AFMhdz9zXy6Tw6sfQJgDAQbkXqJuv58_3DVM9KHRZrRQjtNdaPrja0s1Y6usnniFwW9SZer5ILOqesrXdN-3dC2pI22BhtH8c0HXaMPHfa-rnEe-tN2SPZKXVs8-pljslqk98lVkN1eXifzLDCChXEQSSOFDguJOudSgkHwG5xBGea5ictyJiFGjWUk47JgGgQXMuS5AS0hFGNyMrS21lXKmsqheTRt06BxigvOZRh76HSAur59WaN16qld9_5OqwTIKIJp7AWNydlAmb61tsdSdf4z3W8UA7XVrbxutdXt0WBAX6saN_9yKrlLv_kvvP9_6w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3047706910</pqid></control><display><type>article</type><title>Real‐time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis</title><source>Wiley</source><creator>Blaschke, Johannes P. ; Brewster, Aaron S. ; Paley, Daniel W. ; Mendez, Derek ; Bhowmick, Asmit ; Sauter, Nicholas K. ; Kröger, Wilko ; Shankar, Murali ; Enders, Bjoern ; Bard, Deborah</creator><creatorcontrib>Blaschke, Johannes P. ; Brewster, Aaron S. ; Paley, Daniel W. ; Mendez, Derek ; Bhowmick, Asmit ; Sauter, Nicholas K. ; Kröger, Wilko ; Shankar, Murali ; Enders, Bjoern ; Bard, Deborah ; SLAC National Accelerator Laboratory (SLAC), Menlo Park, CA (United States) ; Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)</creatorcontrib><description>Summary X‐ray scattering experiments using free electron lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID‐19 viral proteins). XFEL experiments are a challenge to computing in two ways: (i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experimental protocols; (ii) data‐collection rates are growing exponentially, requiring new scalable algorithms. Here we report our experiences analyzing data from two experiments at the Linac Coherent Light Source (LCLS) during September 2020. Raw data were analyzed on NERSC's Cori XC40 system, using the Superfacility paradigm: our workflow automatically moves raw data between LCLS and NERSC, where it is analyzed using the software package CCTBX. We achieved real time data analysis with a turnaround time from data acquisition to full molecular reconstruction in as little as 10 min—sufficient time for the experiment's operators to make informed decisions. By hosting the data analysis on Cori, and by automating LCLS‐NERSC interoperability, we achieved a data analysis rate which matches the data acquisition rate. Completing data analysis within 10 min is a first for XFEL experiments and an important milestone if we are to keep up with data‐collection trends.</description><identifier>ISSN: 1532-0626</identifier><identifier>EISSN: 1532-0634</identifier><identifier>DOI: 10.1002/cpe.8019</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc</publisher><subject>Algorithms ; Coherent light ; Data acquisition ; Data analysis ; Decision analysis ; distributed, parallel, and cluster computing ; Experiments ; Free electron lasers ; INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY ; Light sources ; MATHEMATICS AND COMPUTING ; Molecular structure ; real-time, and urgent high-performance computing ; Workflow</subject><ispartof>Concurrency and computation, 2024-05, Vol.36 (12), p.n/a</ispartof><rights>Published 2024. This article is a U.S. Government work and is in the public domain in the USA. published by John Wiley &amp; Sons Ltd.</rights><rights>Published 2024. This article is a U.S. Government work and is in the public domain in the USA.Concurrency and Computation: Practice and Experiencepublished by John Wiley &amp; Sons Ltd. This article is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3159-74c43a5d4eab2440ce04c4e80f5bbc9ff8409eaef749fd1a0323452bc0a4053</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,777,781,882,27905,27906</link.rule.ids><backlink>$$Uhttps://www.osti.gov/servlets/purl/2322459$$D View this record in Osti.gov$$Hfree_for_read</backlink></links><search><creatorcontrib>Blaschke, Johannes P.</creatorcontrib><creatorcontrib>Brewster, Aaron S.</creatorcontrib><creatorcontrib>Paley, Daniel W.</creatorcontrib><creatorcontrib>Mendez, Derek</creatorcontrib><creatorcontrib>Bhowmick, Asmit</creatorcontrib><creatorcontrib>Sauter, Nicholas K.</creatorcontrib><creatorcontrib>Kröger, Wilko</creatorcontrib><creatorcontrib>Shankar, Murali</creatorcontrib><creatorcontrib>Enders, Bjoern</creatorcontrib><creatorcontrib>Bard, Deborah</creatorcontrib><creatorcontrib>SLAC National Accelerator Laboratory (SLAC), Menlo Park, CA (United States)</creatorcontrib><creatorcontrib>Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)</creatorcontrib><title>Real‐time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis</title><title>Concurrency and computation</title><description>Summary X‐ray scattering experiments using free electron lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID‐19 viral proteins). XFEL experiments are a challenge to computing in two ways: (i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experimental protocols; (ii) data‐collection rates are growing exponentially, requiring new scalable algorithms. Here we report our experiences analyzing data from two experiments at the Linac Coherent Light Source (LCLS) during September 2020. Raw data were analyzed on NERSC's Cori XC40 system, using the Superfacility paradigm: our workflow automatically moves raw data between LCLS and NERSC, where it is analyzed using the software package CCTBX. We achieved real time data analysis with a turnaround time from data acquisition to full molecular reconstruction in as little as 10 min—sufficient time for the experiment's operators to make informed decisions. By hosting the data analysis on Cori, and by automating LCLS‐NERSC interoperability, we achieved a data analysis rate which matches the data acquisition rate. Completing data analysis within 10 min is a first for XFEL experiments and an important milestone if we are to keep up with data‐collection trends.</description><subject>Algorithms</subject><subject>Coherent light</subject><subject>Data acquisition</subject><subject>Data analysis</subject><subject>Decision analysis</subject><subject>distributed, parallel, and cluster computing</subject><subject>Experiments</subject><subject>Free electron lasers</subject><subject>INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY</subject><subject>Light sources</subject><subject>MATHEMATICS AND COMPUTING</subject><subject>Molecular structure</subject><subject>real-time, and urgent high-performance computing</subject><subject>Workflow</subject><issn>1532-0626</issn><issn>1532-0634</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><recordid>eNp1kM9Kw0AQhxdRsFbBR1j04iV19k-axlsJqQpBpfXgbdlsJhiJScxu0d58BJ_RJ3FrRPDgaX4MHz9mPkKOGUwYAD83HU5mwOIdMmKh4AFMhdz9zXy6Tw6sfQJgDAQbkXqJuv58_3DVM9KHRZrRQjtNdaPrja0s1Y6usnniFwW9SZer5ILOqesrXdN-3dC2pI22BhtH8c0HXaMPHfa-rnEe-tN2SPZKXVs8-pljslqk98lVkN1eXifzLDCChXEQSSOFDguJOudSgkHwG5xBGea5ictyJiFGjWUk47JgGgQXMuS5AS0hFGNyMrS21lXKmsqheTRt06BxigvOZRh76HSAur59WaN16qld9_5OqwTIKIJp7AWNydlAmb61tsdSdf4z3W8UA7XVrbxutdXt0WBAX6saN_9yKrlLv_kvvP9_6w</recordid><startdate>20240530</startdate><enddate>20240530</enddate><creator>Blaschke, Johannes P.</creator><creator>Brewster, Aaron S.</creator><creator>Paley, Daniel W.</creator><creator>Mendez, Derek</creator><creator>Bhowmick, Asmit</creator><creator>Sauter, Nicholas K.</creator><creator>Kröger, Wilko</creator><creator>Shankar, Murali</creator><creator>Enders, Bjoern</creator><creator>Bard, Deborah</creator><general>Wiley Subscription Services, Inc</general><general>Wiley</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>OIOZB</scope><scope>OTOTI</scope></search><sort><creationdate>20240530</creationdate><title>Real‐time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis</title><author>Blaschke, Johannes P. ; Brewster, Aaron S. ; Paley, Daniel W. ; Mendez, Derek ; Bhowmick, Asmit ; Sauter, Nicholas K. ; Kröger, Wilko ; Shankar, Murali ; Enders, Bjoern ; Bard, Deborah</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3159-74c43a5d4eab2440ce04c4e80f5bbc9ff8409eaef749fd1a0323452bc0a4053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Coherent light</topic><topic>Data acquisition</topic><topic>Data analysis</topic><topic>Decision analysis</topic><topic>distributed, parallel, and cluster computing</topic><topic>Experiments</topic><topic>Free electron lasers</topic><topic>INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY</topic><topic>Light sources</topic><topic>MATHEMATICS AND COMPUTING</topic><topic>Molecular structure</topic><topic>real-time, and urgent high-performance computing</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Blaschke, Johannes P.</creatorcontrib><creatorcontrib>Brewster, Aaron S.</creatorcontrib><creatorcontrib>Paley, Daniel W.</creatorcontrib><creatorcontrib>Mendez, Derek</creatorcontrib><creatorcontrib>Bhowmick, Asmit</creatorcontrib><creatorcontrib>Sauter, Nicholas K.</creatorcontrib><creatorcontrib>Kröger, Wilko</creatorcontrib><creatorcontrib>Shankar, Murali</creatorcontrib><creatorcontrib>Enders, Bjoern</creatorcontrib><creatorcontrib>Bard, Deborah</creatorcontrib><creatorcontrib>SLAC National Accelerator Laboratory (SLAC), Menlo Park, CA (United States)</creatorcontrib><creatorcontrib>Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)</creatorcontrib><collection>Wiley Open Access Journals</collection><collection>Wiley Online Library Open Access</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>OSTI.GOV - Hybrid</collection><collection>OSTI.GOV</collection><jtitle>Concurrency and computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Blaschke, Johannes P.</au><au>Brewster, Aaron S.</au><au>Paley, Daniel W.</au><au>Mendez, Derek</au><au>Bhowmick, Asmit</au><au>Sauter, Nicholas K.</au><au>Kröger, Wilko</au><au>Shankar, Murali</au><au>Enders, Bjoern</au><au>Bard, Deborah</au><aucorp>SLAC National Accelerator Laboratory (SLAC), Menlo Park, CA (United States)</aucorp><aucorp>Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Real‐time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis</atitle><jtitle>Concurrency and computation</jtitle><date>2024-05-30</date><risdate>2024</risdate><volume>36</volume><issue>12</issue><epage>n/a</epage><issn>1532-0626</issn><eissn>1532-0634</eissn><abstract>Summary X‐ray scattering experiments using free electron lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID‐19 viral proteins). XFEL experiments are a challenge to computing in two ways: (i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experimental protocols; (ii) data‐collection rates are growing exponentially, requiring new scalable algorithms. Here we report our experiences analyzing data from two experiments at the Linac Coherent Light Source (LCLS) during September 2020. Raw data were analyzed on NERSC's Cori XC40 system, using the Superfacility paradigm: our workflow automatically moves raw data between LCLS and NERSC, where it is analyzed using the software package CCTBX. We achieved real time data analysis with a turnaround time from data acquisition to full molecular reconstruction in as little as 10 min—sufficient time for the experiment's operators to make informed decisions. By hosting the data analysis on Cori, and by automating LCLS‐NERSC interoperability, we achieved a data analysis rate which matches the data acquisition rate. Completing data analysis within 10 min is a first for XFEL experiments and an important milestone if we are to keep up with data‐collection trends.</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/cpe.8019</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1532-0626
ispartof Concurrency and computation, 2024-05, Vol.36 (12), p.n/a
issn 1532-0626
1532-0634
language eng
recordid cdi_osti_scitechconnect_2322459
source Wiley
subjects Algorithms
Coherent light
Data acquisition
Data analysis
Decision analysis
distributed, parallel, and cluster computing
Experiments
Free electron lasers
INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY
Light sources
MATHEMATICS AND COMPUTING
Molecular structure
real-time, and urgent high-performance computing
Workflow
title Real‐time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T22%3A40%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_osti_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Real%E2%80%90time%20XFEL%20data%20analysis%20at%20SLAC%20and%20NERSC:%20A%20trial%20run%20of%20nascent%20exascale%20experimental%20data%20analysis&rft.jtitle=Concurrency%20and%20computation&rft.au=Blaschke,%20Johannes%20P.&rft.aucorp=SLAC%20National%20Accelerator%20Laboratory%20(SLAC),%20Menlo%20Park,%20CA%20(United%20States)&rft.date=2024-05-30&rft.volume=36&rft.issue=12&rft.epage=n/a&rft.issn=1532-0626&rft.eissn=1532-0634&rft_id=info:doi/10.1002/cpe.8019&rft_dat=%3Cproquest_osti_%3E3047706910%3C/proquest_osti_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3159-74c43a5d4eab2440ce04c4e80f5bbc9ff8409eaef749fd1a0323452bc0a4053%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3047706910&rft_id=info:pmid/&rfr_iscdi=true