Loading…

Preserving File Provenance Using Principles of Blockchain to Ensure Scientific Reproducibility

Reproducibility plays an essential role in scientific research to ensure accuracy and serves as a foundation for future advancements. Scientific reproducibility becomes particularly challenging when dealing with vast amounts of input files that change hands or move across different laboratories or o...

Full description

Saved in:
Bibliographic Details
Main Authors: Hasan, Rizbanul, Purawat, Shweta, Olschanowsky, Catherine, Altintas, Ilkay
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 7
container_issue
container_start_page 1
container_title
container_volume
creator Hasan, Rizbanul
Purawat, Shweta
Olschanowsky, Catherine
Altintas, Ilkay
description Reproducibility plays an essential role in scientific research to ensure accuracy and serves as a foundation for future advancements. Scientific reproducibility becomes particularly challenging when dealing with vast amounts of input files that change hands or move across different laboratories or organizations. Preserving the provenance of data files ensures critical information about the originality of data files is captured to support the reproducibility of scientific research. The paper focuses on capturing and verifying input and output data file provenance using the principles of blockchain. The technique stores the hashes of data files in a database along with user and workflow information. It allows the workflow to verify the data against the hashes at any point. The method is demonstrated using Parflow, a Hydrologic model, as a proof-of-concept.
doi_str_mv 10.1109/e-Science58273.2023.10254665
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10254665</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10254665</ieee_id><sourcerecordid>10254665</sourcerecordid><originalsourceid>FETCH-LOGICAL-i119t-994adb0c94c13bb3aed1dc767d89fcc93f08f3344973269d320e3cd5c8a2d0013</originalsourceid><addsrcrecordid>eNo1kE1LAzEURaMgWLT_wEUWbqcmeTOTyVJLq0LBonZryby80afjTEmmhf5769fqwrlwuFwhLrWaaK3cFWVPyNQhFZWxMDHKwEQrU-RlWRyJsbOugkKBMQb0sRgZMEUGVsGpGKf0rtSh0tqWMBIvy0iJ4o67VznnluQy9jvq_MEtV-mbLiN3yJuWkuwbedP2-IFvnjs59HLWpW0k-TNm4IZRPtIm9mGLXHPLw_5cnDS-TTT-yzOxms-ep3fZ4uH2fnq9yFhrN2TO5T7UCl2OGuoaPAUd0JY2VK5BdNCoqgHIc2fBlC6AUQQYCqy8CUppOBMXv14movUm8qeP-_X_JfAFBoFZPw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Preserving File Provenance Using Principles of Blockchain to Ensure Scientific Reproducibility</title><source>IEEE Xplore All Conference Series</source><creator>Hasan, Rizbanul ; Purawat, Shweta ; Olschanowsky, Catherine ; Altintas, Ilkay</creator><creatorcontrib>Hasan, Rizbanul ; Purawat, Shweta ; Olschanowsky, Catherine ; Altintas, Ilkay</creatorcontrib><description>Reproducibility plays an essential role in scientific research to ensure accuracy and serves as a foundation for future advancements. Scientific reproducibility becomes particularly challenging when dealing with vast amounts of input files that change hands or move across different laboratories or organizations. Preserving the provenance of data files ensures critical information about the originality of data files is captured to support the reproducibility of scientific research. The paper focuses on capturing and verifying input and output data file provenance using the principles of blockchain. The technique stores the hashes of data files in a database along with user and workflow information. It allows the workflow to verify the data against the hashes at any point. The method is demonstrated using Parflow, a Hydrologic model, as a proof-of-concept.</description><identifier>EISSN: 2325-3703</identifier><identifier>EISBN: 9798350322231</identifier><identifier>DOI: 10.1109/e-Science58273.2023.10254665</identifier><language>eng</language><publisher>IEEE</publisher><subject>blockchain ; Blockchains ; Data integrity ; data provenance ; Laboratories ; metadata ; Organizations ; Reliability ; reproducibility ; Reproducibility of results ; scientific workflow</subject><ispartof>2023 IEEE 19th International Conference on e-Science (e-Science), 2023, p.1-7</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10254665$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10254665$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Hasan, Rizbanul</creatorcontrib><creatorcontrib>Purawat, Shweta</creatorcontrib><creatorcontrib>Olschanowsky, Catherine</creatorcontrib><creatorcontrib>Altintas, Ilkay</creatorcontrib><title>Preserving File Provenance Using Principles of Blockchain to Ensure Scientific Reproducibility</title><title>2023 IEEE 19th International Conference on e-Science (e-Science)</title><addtitle>E-SCIENCE</addtitle><description>Reproducibility plays an essential role in scientific research to ensure accuracy and serves as a foundation for future advancements. Scientific reproducibility becomes particularly challenging when dealing with vast amounts of input files that change hands or move across different laboratories or organizations. Preserving the provenance of data files ensures critical information about the originality of data files is captured to support the reproducibility of scientific research. The paper focuses on capturing and verifying input and output data file provenance using the principles of blockchain. The technique stores the hashes of data files in a database along with user and workflow information. It allows the workflow to verify the data against the hashes at any point. The method is demonstrated using Parflow, a Hydrologic model, as a proof-of-concept.</description><subject>blockchain</subject><subject>Blockchains</subject><subject>Data integrity</subject><subject>data provenance</subject><subject>Laboratories</subject><subject>metadata</subject><subject>Organizations</subject><subject>Reliability</subject><subject>reproducibility</subject><subject>Reproducibility of results</subject><subject>scientific workflow</subject><issn>2325-3703</issn><isbn>9798350322231</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNo1kE1LAzEURaMgWLT_wEUWbqcmeTOTyVJLq0LBonZryby80afjTEmmhf5769fqwrlwuFwhLrWaaK3cFWVPyNQhFZWxMDHKwEQrU-RlWRyJsbOugkKBMQb0sRgZMEUGVsGpGKf0rtSh0tqWMBIvy0iJ4o67VznnluQy9jvq_MEtV-mbLiN3yJuWkuwbedP2-IFvnjs59HLWpW0k-TNm4IZRPtIm9mGLXHPLw_5cnDS-TTT-yzOxms-ep3fZ4uH2fnq9yFhrN2TO5T7UCl2OGuoaPAUd0JY2VK5BdNCoqgHIc2fBlC6AUQQYCqy8CUppOBMXv14movUm8qeP-_X_JfAFBoFZPw</recordid><startdate>20231009</startdate><enddate>20231009</enddate><creator>Hasan, Rizbanul</creator><creator>Purawat, Shweta</creator><creator>Olschanowsky, Catherine</creator><creator>Altintas, Ilkay</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>20231009</creationdate><title>Preserving File Provenance Using Principles of Blockchain to Ensure Scientific Reproducibility</title><author>Hasan, Rizbanul ; Purawat, Shweta ; Olschanowsky, Catherine ; Altintas, Ilkay</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i119t-994adb0c94c13bb3aed1dc767d89fcc93f08f3344973269d320e3cd5c8a2d0013</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>blockchain</topic><topic>Blockchains</topic><topic>Data integrity</topic><topic>data provenance</topic><topic>Laboratories</topic><topic>metadata</topic><topic>Organizations</topic><topic>Reliability</topic><topic>reproducibility</topic><topic>Reproducibility of results</topic><topic>scientific workflow</topic><toplevel>online_resources</toplevel><creatorcontrib>Hasan, Rizbanul</creatorcontrib><creatorcontrib>Purawat, Shweta</creatorcontrib><creatorcontrib>Olschanowsky, Catherine</creatorcontrib><creatorcontrib>Altintas, Ilkay</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library Online</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hasan, Rizbanul</au><au>Purawat, Shweta</au><au>Olschanowsky, Catherine</au><au>Altintas, Ilkay</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Preserving File Provenance Using Principles of Blockchain to Ensure Scientific Reproducibility</atitle><btitle>2023 IEEE 19th International Conference on e-Science (e-Science)</btitle><stitle>E-SCIENCE</stitle><date>2023-10-09</date><risdate>2023</risdate><spage>1</spage><epage>7</epage><pages>1-7</pages><eissn>2325-3703</eissn><eisbn>9798350322231</eisbn><abstract>Reproducibility plays an essential role in scientific research to ensure accuracy and serves as a foundation for future advancements. Scientific reproducibility becomes particularly challenging when dealing with vast amounts of input files that change hands or move across different laboratories or organizations. Preserving the provenance of data files ensures critical information about the originality of data files is captured to support the reproducibility of scientific research. The paper focuses on capturing and verifying input and output data file provenance using the principles of blockchain. The technique stores the hashes of data files in a database along with user and workflow information. It allows the workflow to verify the data against the hashes at any point. The method is demonstrated using Parflow, a Hydrologic model, as a proof-of-concept.</abstract><pub>IEEE</pub><doi>10.1109/e-Science58273.2023.10254665</doi><tpages>7</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2325-3703
ispartof 2023 IEEE 19th International Conference on e-Science (e-Science), 2023, p.1-7
issn 2325-3703
language eng
recordid cdi_ieee_primary_10254665
source IEEE Xplore All Conference Series
subjects blockchain
Blockchains
Data integrity
data provenance
Laboratories
metadata
Organizations
Reliability
reproducibility
Reproducibility of results
scientific workflow
title Preserving File Provenance Using Principles of Blockchain to Ensure Scientific Reproducibility
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T16%3A15%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Preserving%20File%20Provenance%20Using%20Principles%20of%20Blockchain%20to%20Ensure%20Scientific%20Reproducibility&rft.btitle=2023%20IEEE%2019th%20International%20Conference%20on%20e-Science%20(e-Science)&rft.au=Hasan,%20Rizbanul&rft.date=2023-10-09&rft.spage=1&rft.epage=7&rft.pages=1-7&rft.eissn=2325-3703&rft_id=info:doi/10.1109/e-Science58273.2023.10254665&rft.eisbn=9798350322231&rft_dat=%3Cieee_CHZPO%3E10254665%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i119t-994adb0c94c13bb3aed1dc767d89fcc93f08f3344973269d320e3cd5c8a2d0013%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10254665&rfr_iscdi=true