Loading…

Go Get Data (GGD) is a framework that facilitates reproducible access to genomic data

The rapid increase in the amount of genomic data provides researchers with an opportunity to integrate diverse datasets and annotations when addressing a wide range of biological questions. However, genomic datasets are deposited on different platforms and are stored in numerous formats from multipl...

Full description

Saved in:
Bibliographic Details
Published in:Nature communications 2021-04, Vol.12 (1), p.2151-2151, Article 2151
Main Authors: Cormier, Michael J., Belyeu, Jonathan R., Pedersen, Brent S., Brown, Joseph, Köster, Johannes, Quinlan, Aaron R.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The rapid increase in the amount of genomic data provides researchers with an opportunity to integrate diverse datasets and annotations when addressing a wide range of biological questions. However, genomic datasets are deposited on different platforms and are stored in numerous formats from multiple genome builds, which complicates the task of collecting, annotating, transforming, and integrating data as needed. Here, we developed Go Get Data (GGD) as a fast, reproducible approach to installing standardized data recipes. GGD is available on Github ( https://gogetdata.github.io/ ), is extendable to other data types, and can streamline the complexities typically associated with data integration, saving researchers time and improving research reproducibility. Modern biological research is complicated by the difficulty of collecting, transforming, annotating, and integrating datasets. Here, the authors present Go Get Data, a fast, reproducible approach to installing standardized data recipes, with an application to genomics data.
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-021-22381-z