Loading…

ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses

Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species pos...

Full description

Saved in:
Bibliographic Details
Published in:PeerJ (San Francisco, CA) CA), 2021-05, Vol.9, p.e11376-e11376, Article e11376
Main Authors: Pavlovikj, Natasha, Gomes-Neto, Joao Carlos, Deogun, Jitender S., Benson, Andrew K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613
cites cdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613
container_end_page e11376
container_issue
container_start_page e11376
container_title PeerJ (San Francisco, CA)
container_volume 9
creator Pavlovikj, Natasha
Gomes-Neto, Joao Carlos
Deogun, Jitender S.
Benson, Andrew K.
description Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case s
doi_str_mv 10.7717/peerj.11376
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_1392ab39bdc9458084dac3c611025ef9</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_1392ab39bdc9458084dac3c611025ef9</doaj_id><sourcerecordid>2535107586</sourcerecordid><originalsourceid>FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</originalsourceid><addsrcrecordid>eNpdkl9r1jAUh4sobsxd-QUC3giuM3-apvVCGGPqYKAXeh1Ok9M279qmJukr-_bG9x3izM1Jznl4CIdfUbxm9FIppt6viGF3yZhQ9bPilLNalY2Q7fN_7ifFeYw7mk_Da9qIl8WJqKiUVUNPi_234O9v9v4DgYXAlvwMCe0FCbgGbzfjugkv8sySaGCC_CJ9gBl_-XBPeh_I6IaxTGPw2zCuWyIdmITBwURWv24TJOcXMuDiZ2diFsH0EDG-Kl70MEU8f6xnxY9PN9-vv5R3Xz_fXl_dlaaSPJXQ1n3bdYYbCooKjrkgZ1YZZHWrBLe2tbTqOiEUM8JKyYTpmDGMSiNrJs6K26PXetjpNbgZwoP24PSh4cOgISRnJtRMtBw60XbWtJVsaFNZMMLUjFEusW-z6-PRtW7djNbgkgJMT6RPJ4sb9eD3umEVbwXPgrePguB_bhiTnl00OE2woN-i5lJIRpVs6oy--Q_d-S3k5R0oypqKKZGpd0fKBB9jwP7vZxjVf-KhD_HQh3iI3yforoI</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2530184173</pqid></control><display><type>article</type><title>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</title><source>PubMed Central (Open Access)</source><source>Publicly Available Content Database</source><creator>Pavlovikj, Natasha ; Gomes-Neto, Joao Carlos ; Deogun, Jitender S. ; Benson, Andrew K.</creator><creatorcontrib>Pavlovikj, Natasha ; Gomes-Neto, Joao Carlos ; Deogun, Jitender S. ; Benson, Andrew K.</creatorcontrib><description>Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case studies. The specific case studies illustrate how hierarchical analyses of population structures, genotype frequencies, and distribution of specific gene functions can be integrated into an analysis. Collectively, our study shows that ProkEvo presents a practical viable option for scalable, automated analyses of bacterial populations with direct applications for basic microbiology research, clinical microbiological diagnostics, and epidemiological surveillance.</description><identifier>ISSN: 2167-8359</identifier><identifier>EISSN: 2167-8359</identifier><identifier>DOI: 10.7717/peerj.11376</identifier><identifier>PMID: 34055480</identifier><language>eng</language><publisher>San Diego: PeerJ, Inc</publisher><subject>Antimicrobial resistance ; Automation ; Bacteria ; Bayesian analysis ; Bioinformatics ; Classification ; Computational Biology ; Computer applications ; Data processing ; Epidemiology ; Genes ; Genomes ; Genomics ; Genotypes ; High performance computing ; High-throughput computing ; Microbiology ; Molecular Biology ; Nucleotide sequence ; Pan-genome ; Pathogens ; Plasmids ; Population ; Population structure ; Population-genomics ; Public health ; Quality control ; Salmonella ; Scalability ; Species ; Virulence ; Virulence factors</subject><ispartof>PeerJ (San Francisco, CA), 2021-05, Vol.9, p.e11376-e11376, Article e11376</ispartof><rights>2021 Pavlovikj et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2021 Pavlovikj et al. 2021 Pavlovikj et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</citedby><cites>FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</cites><orcidid>0000-0002-9145-8767</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2530184173/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2530184173?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids></links><search><creatorcontrib>Pavlovikj, Natasha</creatorcontrib><creatorcontrib>Gomes-Neto, Joao Carlos</creatorcontrib><creatorcontrib>Deogun, Jitender S.</creatorcontrib><creatorcontrib>Benson, Andrew K.</creatorcontrib><title>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</title><title>PeerJ (San Francisco, CA)</title><description>Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case studies. The specific case studies illustrate how hierarchical analyses of population structures, genotype frequencies, and distribution of specific gene functions can be integrated into an analysis. Collectively, our study shows that ProkEvo presents a practical viable option for scalable, automated analyses of bacterial populations with direct applications for basic microbiology research, clinical microbiological diagnostics, and epidemiological surveillance.</description><subject>Antimicrobial resistance</subject><subject>Automation</subject><subject>Bacteria</subject><subject>Bayesian analysis</subject><subject>Bioinformatics</subject><subject>Classification</subject><subject>Computational Biology</subject><subject>Computer applications</subject><subject>Data processing</subject><subject>Epidemiology</subject><subject>Genes</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Genotypes</subject><subject>High performance computing</subject><subject>High-throughput computing</subject><subject>Microbiology</subject><subject>Molecular Biology</subject><subject>Nucleotide sequence</subject><subject>Pan-genome</subject><subject>Pathogens</subject><subject>Plasmids</subject><subject>Population</subject><subject>Population structure</subject><subject>Population-genomics</subject><subject>Public health</subject><subject>Quality control</subject><subject>Salmonella</subject><subject>Scalability</subject><subject>Species</subject><subject>Virulence</subject><subject>Virulence factors</subject><issn>2167-8359</issn><issn>2167-8359</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpdkl9r1jAUh4sobsxd-QUC3giuM3-apvVCGGPqYKAXeh1Ok9M279qmJukr-_bG9x3izM1Jznl4CIdfUbxm9FIppt6viGF3yZhQ9bPilLNalY2Q7fN_7ifFeYw7mk_Da9qIl8WJqKiUVUNPi_234O9v9v4DgYXAlvwMCe0FCbgGbzfjugkv8sySaGCC_CJ9gBl_-XBPeh_I6IaxTGPw2zCuWyIdmITBwURWv24TJOcXMuDiZ2diFsH0EDG-Kl70MEU8f6xnxY9PN9-vv5R3Xz_fXl_dlaaSPJXQ1n3bdYYbCooKjrkgZ1YZZHWrBLe2tbTqOiEUM8JKyYTpmDGMSiNrJs6K26PXetjpNbgZwoP24PSh4cOgISRnJtRMtBw60XbWtJVsaFNZMMLUjFEusW-z6-PRtW7djNbgkgJMT6RPJ4sb9eD3umEVbwXPgrePguB_bhiTnl00OE2woN-i5lJIRpVs6oy--Q_d-S3k5R0oypqKKZGpd0fKBB9jwP7vZxjVf-KhD_HQh3iI3yforoI</recordid><startdate>20210521</startdate><enddate>20210521</enddate><creator>Pavlovikj, Natasha</creator><creator>Gomes-Neto, Joao Carlos</creator><creator>Deogun, Jitender S.</creator><creator>Benson, Andrew K.</creator><general>PeerJ, Inc</general><general>PeerJ Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>LK8</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-9145-8767</orcidid></search><sort><creationdate>20210521</creationdate><title>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</title><author>Pavlovikj, Natasha ; Gomes-Neto, Joao Carlos ; Deogun, Jitender S. ; Benson, Andrew K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Antimicrobial resistance</topic><topic>Automation</topic><topic>Bacteria</topic><topic>Bayesian analysis</topic><topic>Bioinformatics</topic><topic>Classification</topic><topic>Computational Biology</topic><topic>Computer applications</topic><topic>Data processing</topic><topic>Epidemiology</topic><topic>Genes</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Genotypes</topic><topic>High performance computing</topic><topic>High-throughput computing</topic><topic>Microbiology</topic><topic>Molecular Biology</topic><topic>Nucleotide sequence</topic><topic>Pan-genome</topic><topic>Pathogens</topic><topic>Plasmids</topic><topic>Population</topic><topic>Population structure</topic><topic>Population-genomics</topic><topic>Public health</topic><topic>Quality control</topic><topic>Salmonella</topic><topic>Scalability</topic><topic>Species</topic><topic>Virulence</topic><topic>Virulence factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pavlovikj, Natasha</creatorcontrib><creatorcontrib>Gomes-Neto, Joao Carlos</creatorcontrib><creatorcontrib>Deogun, Jitender S.</creatorcontrib><creatorcontrib>Benson, Andrew K.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>Biological Sciences</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals (Open Access)</collection><jtitle>PeerJ (San Francisco, CA)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pavlovikj, Natasha</au><au>Gomes-Neto, Joao Carlos</au><au>Deogun, Jitender S.</au><au>Benson, Andrew K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</atitle><jtitle>PeerJ (San Francisco, CA)</jtitle><date>2021-05-21</date><risdate>2021</risdate><volume>9</volume><spage>e11376</spage><epage>e11376</epage><pages>e11376-e11376</pages><artnum>e11376</artnum><issn>2167-8359</issn><eissn>2167-8359</eissn><abstract>Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case studies. The specific case studies illustrate how hierarchical analyses of population structures, genotype frequencies, and distribution of specific gene functions can be integrated into an analysis. Collectively, our study shows that ProkEvo presents a practical viable option for scalable, automated analyses of bacterial populations with direct applications for basic microbiology research, clinical microbiological diagnostics, and epidemiological surveillance.</abstract><cop>San Diego</cop><pub>PeerJ, Inc</pub><pmid>34055480</pmid><doi>10.7717/peerj.11376</doi><orcidid>https://orcid.org/0000-0002-9145-8767</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2167-8359
ispartof PeerJ (San Francisco, CA), 2021-05, Vol.9, p.e11376-e11376, Article e11376
issn 2167-8359
2167-8359
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_1392ab39bdc9458084dac3c611025ef9
source PubMed Central (Open Access); Publicly Available Content Database
subjects Antimicrobial resistance
Automation
Bacteria
Bayesian analysis
Bioinformatics
Classification
Computational Biology
Computer applications
Data processing
Epidemiology
Genes
Genomes
Genomics
Genotypes
High performance computing
High-throughput computing
Microbiology
Molecular Biology
Nucleotide sequence
Pan-genome
Pathogens
Plasmids
Population
Population structure
Population-genomics
Public health
Quality control
Salmonella
Scalability
Species
Virulence
Virulence factors
title ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T14%3A22%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ProkEvo:%20an%20automated,%20reproducible,%20and%20scalable%20framework%20for%20high-throughput%20bacterial%20population%20genomics%20analyses&rft.jtitle=PeerJ%20(San%20Francisco,%20CA)&rft.au=Pavlovikj,%20Natasha&rft.date=2021-05-21&rft.volume=9&rft.spage=e11376&rft.epage=e11376&rft.pages=e11376-e11376&rft.artnum=e11376&rft.issn=2167-8359&rft.eissn=2167-8359&rft_id=info:doi/10.7717/peerj.11376&rft_dat=%3Cproquest_doaj_%3E2535107586%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2530184173&rft_id=info:pmid/34055480&rfr_iscdi=true