Loading…
ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses
Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species pos...
Saved in:
Published in: | PeerJ (San Francisco, CA) CA), 2021-05, Vol.9, p.e11376-e11376, Article e11376 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613 |
---|---|
cites | cdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613 |
container_end_page | e11376 |
container_issue | |
container_start_page | e11376 |
container_title | PeerJ (San Francisco, CA) |
container_volume | 9 |
creator | Pavlovikj, Natasha Gomes-Neto, Joao Carlos Deogun, Jitender S. Benson, Andrew K. |
description | Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case s |
doi_str_mv | 10.7717/peerj.11376 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_1392ab39bdc9458084dac3c611025ef9</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_1392ab39bdc9458084dac3c611025ef9</doaj_id><sourcerecordid>2535107586</sourcerecordid><originalsourceid>FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</originalsourceid><addsrcrecordid>eNpdkl9r1jAUh4sobsxd-QUC3giuM3-apvVCGGPqYKAXeh1Ok9M279qmJukr-_bG9x3izM1Jznl4CIdfUbxm9FIppt6viGF3yZhQ9bPilLNalY2Q7fN_7ifFeYw7mk_Da9qIl8WJqKiUVUNPi_234O9v9v4DgYXAlvwMCe0FCbgGbzfjugkv8sySaGCC_CJ9gBl_-XBPeh_I6IaxTGPw2zCuWyIdmITBwURWv24TJOcXMuDiZ2diFsH0EDG-Kl70MEU8f6xnxY9PN9-vv5R3Xz_fXl_dlaaSPJXQ1n3bdYYbCooKjrkgZ1YZZHWrBLe2tbTqOiEUM8JKyYTpmDGMSiNrJs6K26PXetjpNbgZwoP24PSh4cOgISRnJtRMtBw60XbWtJVsaFNZMMLUjFEusW-z6-PRtW7djNbgkgJMT6RPJ4sb9eD3umEVbwXPgrePguB_bhiTnl00OE2woN-i5lJIRpVs6oy--Q_d-S3k5R0oypqKKZGpd0fKBB9jwP7vZxjVf-KhD_HQh3iI3yforoI</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2530184173</pqid></control><display><type>article</type><title>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</title><source>PubMed Central (Open Access)</source><source>Publicly Available Content Database</source><creator>Pavlovikj, Natasha ; Gomes-Neto, Joao Carlos ; Deogun, Jitender S. ; Benson, Andrew K.</creator><creatorcontrib>Pavlovikj, Natasha ; Gomes-Neto, Joao Carlos ; Deogun, Jitender S. ; Benson, Andrew K.</creatorcontrib><description>Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case studies. The specific case studies illustrate how hierarchical analyses of population structures, genotype frequencies, and distribution of specific gene functions can be integrated into an analysis. Collectively, our study shows that ProkEvo presents a practical viable option for scalable, automated analyses of bacterial populations with direct applications for basic microbiology research, clinical microbiological diagnostics, and epidemiological surveillance.</description><identifier>ISSN: 2167-8359</identifier><identifier>EISSN: 2167-8359</identifier><identifier>DOI: 10.7717/peerj.11376</identifier><identifier>PMID: 34055480</identifier><language>eng</language><publisher>San Diego: PeerJ, Inc</publisher><subject>Antimicrobial resistance ; Automation ; Bacteria ; Bayesian analysis ; Bioinformatics ; Classification ; Computational Biology ; Computer applications ; Data processing ; Epidemiology ; Genes ; Genomes ; Genomics ; Genotypes ; High performance computing ; High-throughput computing ; Microbiology ; Molecular Biology ; Nucleotide sequence ; Pan-genome ; Pathogens ; Plasmids ; Population ; Population structure ; Population-genomics ; Public health ; Quality control ; Salmonella ; Scalability ; Species ; Virulence ; Virulence factors</subject><ispartof>PeerJ (San Francisco, CA), 2021-05, Vol.9, p.e11376-e11376, Article e11376</ispartof><rights>2021 Pavlovikj et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2021 Pavlovikj et al. 2021 Pavlovikj et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</citedby><cites>FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</cites><orcidid>0000-0002-9145-8767</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2530184173/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2530184173?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids></links><search><creatorcontrib>Pavlovikj, Natasha</creatorcontrib><creatorcontrib>Gomes-Neto, Joao Carlos</creatorcontrib><creatorcontrib>Deogun, Jitender S.</creatorcontrib><creatorcontrib>Benson, Andrew K.</creatorcontrib><title>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</title><title>PeerJ (San Francisco, CA)</title><description>Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case studies. The specific case studies illustrate how hierarchical analyses of population structures, genotype frequencies, and distribution of specific gene functions can be integrated into an analysis. Collectively, our study shows that ProkEvo presents a practical viable option for scalable, automated analyses of bacterial populations with direct applications for basic microbiology research, clinical microbiological diagnostics, and epidemiological surveillance.</description><subject>Antimicrobial resistance</subject><subject>Automation</subject><subject>Bacteria</subject><subject>Bayesian analysis</subject><subject>Bioinformatics</subject><subject>Classification</subject><subject>Computational Biology</subject><subject>Computer applications</subject><subject>Data processing</subject><subject>Epidemiology</subject><subject>Genes</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Genotypes</subject><subject>High performance computing</subject><subject>High-throughput computing</subject><subject>Microbiology</subject><subject>Molecular Biology</subject><subject>Nucleotide sequence</subject><subject>Pan-genome</subject><subject>Pathogens</subject><subject>Plasmids</subject><subject>Population</subject><subject>Population structure</subject><subject>Population-genomics</subject><subject>Public health</subject><subject>Quality control</subject><subject>Salmonella</subject><subject>Scalability</subject><subject>Species</subject><subject>Virulence</subject><subject>Virulence factors</subject><issn>2167-8359</issn><issn>2167-8359</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpdkl9r1jAUh4sobsxd-QUC3giuM3-apvVCGGPqYKAXeh1Ok9M279qmJukr-_bG9x3izM1Jznl4CIdfUbxm9FIppt6viGF3yZhQ9bPilLNalY2Q7fN_7ifFeYw7mk_Da9qIl8WJqKiUVUNPi_234O9v9v4DgYXAlvwMCe0FCbgGbzfjugkv8sySaGCC_CJ9gBl_-XBPeh_I6IaxTGPw2zCuWyIdmITBwURWv24TJOcXMuDiZ2diFsH0EDG-Kl70MEU8f6xnxY9PN9-vv5R3Xz_fXl_dlaaSPJXQ1n3bdYYbCooKjrkgZ1YZZHWrBLe2tbTqOiEUM8JKyYTpmDGMSiNrJs6K26PXetjpNbgZwoP24PSh4cOgISRnJtRMtBw60XbWtJVsaFNZMMLUjFEusW-z6-PRtW7djNbgkgJMT6RPJ4sb9eD3umEVbwXPgrePguB_bhiTnl00OE2woN-i5lJIRpVs6oy--Q_d-S3k5R0oypqKKZGpd0fKBB9jwP7vZxjVf-KhD_HQh3iI3yforoI</recordid><startdate>20210521</startdate><enddate>20210521</enddate><creator>Pavlovikj, Natasha</creator><creator>Gomes-Neto, Joao Carlos</creator><creator>Deogun, Jitender S.</creator><creator>Benson, Andrew K.</creator><general>PeerJ, Inc</general><general>PeerJ Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>LK8</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-9145-8767</orcidid></search><sort><creationdate>20210521</creationdate><title>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</title><author>Pavlovikj, Natasha ; Gomes-Neto, Joao Carlos ; Deogun, Jitender S. ; Benson, Andrew K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Antimicrobial resistance</topic><topic>Automation</topic><topic>Bacteria</topic><topic>Bayesian analysis</topic><topic>Bioinformatics</topic><topic>Classification</topic><topic>Computational Biology</topic><topic>Computer applications</topic><topic>Data processing</topic><topic>Epidemiology</topic><topic>Genes</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Genotypes</topic><topic>High performance computing</topic><topic>High-throughput computing</topic><topic>Microbiology</topic><topic>Molecular Biology</topic><topic>Nucleotide sequence</topic><topic>Pan-genome</topic><topic>Pathogens</topic><topic>Plasmids</topic><topic>Population</topic><topic>Population structure</topic><topic>Population-genomics</topic><topic>Public health</topic><topic>Quality control</topic><topic>Salmonella</topic><topic>Scalability</topic><topic>Species</topic><topic>Virulence</topic><topic>Virulence factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pavlovikj, Natasha</creatorcontrib><creatorcontrib>Gomes-Neto, Joao Carlos</creatorcontrib><creatorcontrib>Deogun, Jitender S.</creatorcontrib><creatorcontrib>Benson, Andrew K.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>Biological Sciences</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals (Open Access)</collection><jtitle>PeerJ (San Francisco, CA)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pavlovikj, Natasha</au><au>Gomes-Neto, Joao Carlos</au><au>Deogun, Jitender S.</au><au>Benson, Andrew K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses</atitle><jtitle>PeerJ (San Francisco, CA)</jtitle><date>2021-05-21</date><risdate>2021</risdate><volume>9</volume><spage>e11376</spage><epage>e11376</epage><pages>e11376-e11376</pages><artnum>e11376</artnum><issn>2167-8359</issn><eissn>2167-8359</eissn><abstract>Whole Genome Sequence (WGS) data from bacterial species is used for a variety of applications ranging from basic microbiological research, diagnostics, and epidemiological surveillance. The availability of WGS data from hundreds of thousands of individual isolates of individual microbial species poses a tremendous opportunity for discovery and hypothesis-generating research into ecology and evolution of these microorganisms. Flexibility, scalability, and user-friendliness of existing pipelines for population-scale inquiry, however, limit applications of systematic, population-scale approaches. Here, we present ProkEvo, an automated, scalable, reproducible, and open-source framework for bacterial population genomics analyses using WGS data. ProkEvo was specifically developed to achieve the following goals: (1) Automation and scaling of complex combinations of computational analyses for many thousands of bacterial genomes from inputs of raw Illumina paired-end sequence reads; (2) Use of workflow management systems (WMS) such as Pegasus WMS to ensure reproducibility, scalability, modularity, fault-tolerance, and robust file management throughout the process; (3) Use of high-performance and high-throughput computational platforms; (4) Generation of hierarchical-based population structure analysis based on combinations of multi-locus and Bayesian statistical approaches for classification for ecological and epidemiological inquiries; (5) Association of antimicrobial resistance (AMR) genes, putative virulence factors, and plasmids from curated databases with the hierarchically-related genotypic classifications; and (6) Production of pan-genome annotations and data compilation that can be utilized for downstream analysis such as identification of population-specific genomic signatures. The scalability of ProkEvo was measured with two datasets comprising significantly different numbers of input genomes (one with ~2,400 genomes, and the second with ~23,000 genomes). Depending on the dataset and the computational platform used, the running time of ProkEvo varied from ~3-26 days. ProkEvo can be used with virtually any bacterial species, and the Pegasus WMS uniquely facilitates addition or removal of programs from the workflow or modification of options within them. To demonstrate versatility of the ProkEvo platform, we performed a hierarchical-based population structure analyses from available genomes of three distinct pathogenic bacterial species as individual case studies. The specific case studies illustrate how hierarchical analyses of population structures, genotype frequencies, and distribution of specific gene functions can be integrated into an analysis. Collectively, our study shows that ProkEvo presents a practical viable option for scalable, automated analyses of bacterial populations with direct applications for basic microbiology research, clinical microbiological diagnostics, and epidemiological surveillance.</abstract><cop>San Diego</cop><pub>PeerJ, Inc</pub><pmid>34055480</pmid><doi>10.7717/peerj.11376</doi><orcidid>https://orcid.org/0000-0002-9145-8767</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2167-8359 |
ispartof | PeerJ (San Francisco, CA), 2021-05, Vol.9, p.e11376-e11376, Article e11376 |
issn | 2167-8359 2167-8359 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_1392ab39bdc9458084dac3c611025ef9 |
source | PubMed Central (Open Access); Publicly Available Content Database |
subjects | Antimicrobial resistance Automation Bacteria Bayesian analysis Bioinformatics Classification Computational Biology Computer applications Data processing Epidemiology Genes Genomes Genomics Genotypes High performance computing High-throughput computing Microbiology Molecular Biology Nucleotide sequence Pan-genome Pathogens Plasmids Population Population structure Population-genomics Public health Quality control Salmonella Scalability Species Virulence Virulence factors |
title | ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T14%3A22%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ProkEvo:%20an%20automated,%20reproducible,%20and%20scalable%20framework%20for%20high-throughput%20bacterial%20population%20genomics%20analyses&rft.jtitle=PeerJ%20(San%20Francisco,%20CA)&rft.au=Pavlovikj,%20Natasha&rft.date=2021-05-21&rft.volume=9&rft.spage=e11376&rft.epage=e11376&rft.pages=e11376-e11376&rft.artnum=e11376&rft.issn=2167-8359&rft.eissn=2167-8359&rft_id=info:doi/10.7717/peerj.11376&rft_dat=%3Cproquest_doaj_%3E2535107586%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c452t-a96f9bbc2c0a7032e0a7e21d7ce169732dd9d04bb3371c3d5513cb1cc105c5613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2530184173&rft_id=info:pmid/34055480&rfr_iscdi=true |