Loading…

AGeS: a software system for microbial genome sequence annotation

The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large num...

Full description

Saved in:
Bibliographic Details
Published in:PloS one 2011-03, Vol.6 (3), p.e17469-e17469
Main Authors: Kumar, Kamal, Desai, Valmik, Cheng, Li, Khitrov, Maxim, Grover, Deepak, Satya, Ravi Vijaya, Yu, Chenggang, Zavaljevski, Nela, Reifman, Jaques
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353
cites cdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353
container_end_page e17469
container_issue 3
container_start_page e17469
container_title PloS one
container_volume 6
creator Kumar, Kamal
Desai, Valmik
Cheng, Li
Khitrov, Maxim
Grover, Deepak
Satya, Ravi Vijaya
Yu, Chenggang
Zavaljevski, Nela
Reifman, Jaques
description The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance. The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions.
doi_str_mv 10.1371/journal.pone.0017469
format article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1295462184</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A476903397</galeid><doaj_id>oai_doaj_org_article_56972f0851a341dca351736956e50d08</doaj_id><sourcerecordid>A476903397</sourcerecordid><originalsourceid>FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</originalsourceid><addsrcrecordid>eNqNk12L1DAUhoso7jr6D0QLguLFjPlO44U4LLoOLCy46m3ItKczHdpkTFJ1_72p012msqDkouHked8m5yPLnmK0wFTiNzvXe2vaxd5ZWCCEJRPqXnaKFSVzQRC9f7Q_yR6FsEOI00KIh9kJwQwVBMvT7P3yHK7e5iYPro4_jYc8XIcIXV47n3dN6d26MW2-Aeu6dAbfe7Al5MZaF01snH2cPahNG-DJ-J1lXz9--HL2aX5xeb46W17MS0lonGNAjJZEkjVXGCleEkKhqLBUqKAmBRiRpiaCEhCUGbVGmCkqqgpMUawpp7Ps-cF337qgx8cHjUmSCoILlojVgaic2em9bzrjr7Uzjf4TcH6jjY9N2YLmQklSo4JjQxmuSkM5llQoLoCjKt1olr0b_9avO6hKsNGbdmI6PbHNVm_cD00RU1KQZPBqNPAu5SxE3TWhhLY1FlwftEKEckUI-ydZcMkKTvCQghd_kXenYaQ2Jr20sbVLFywHT71kUihEqZKJWtxBpVVBqnpqqbpJ8Yng9USQmAi_4sb0IejV1ef_Zy-_TdmXR-wWTBu3wbX90FthCrIDmHoyBA_1bTUw0sNE3GRDDxOhx4lIsmfHlbwV3YwA_Q3Q_gIQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1295462184</pqid></control><display><type>article</type><title>AGeS: a software system for microbial genome sequence annotation</title><source>PubMed Central(OpenAccess)</source><source>Publicly Available Content (ProQuest)</source><creator>Kumar, Kamal ; Desai, Valmik ; Cheng, Li ; Khitrov, Maxim ; Grover, Deepak ; Satya, Ravi Vijaya ; Yu, Chenggang ; Zavaljevski, Nela ; Reifman, Jaques</creator><contributor>Aerts, Stein</contributor><creatorcontrib>Kumar, Kamal ; Desai, Valmik ; Cheng, Li ; Khitrov, Maxim ; Grover, Deepak ; Satya, Ravi Vijaya ; Yu, Chenggang ; Zavaljevski, Nela ; Reifman, Jaques ; Aerts, Stein</creatorcontrib><description>The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance. The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a &gt;94% overlap in the number of identified genes, a significant number of identical annotated features, and a &gt;90% agreement in enzyme function predictions.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0017469</identifier><identifier>PMID: 21408217</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Acids ; Age ; Annotations ; Armed forces ; Automation ; Bacteria ; Base Sequence ; Bioinformatics ; Biology ; Biotechnology ; Computer programs ; Computer Science ; Data processing ; Gene sequencing ; Genes ; Genes, Bacterial - genetics ; Genome, Bacterial - genetics ; Genomes ; Genomics ; Integrated software ; Linux ; Localization ; Medical research ; Microorganisms ; Molecular Sequence Annotation - methods ; Next-generation sequencing ; Nucleotide sequence ; Ontology ; Proteins ; Reproducibility of Results ; Ribonucleic acid ; RNA ; Software ; Software development tools ; Software upgrading ; Telemedicine ; Web applications</subject><ispartof>PloS one, 2011-03, Vol.6 (3), p.e17469-e17469</ispartof><rights>COPYRIGHT 2011 Public Library of Science</rights><rights>Copyright Public Library of Science Mar 2011</rights><rights>This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. 2011</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</citedby><cites>FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1295462184/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1295462184?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/21408217$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Aerts, Stein</contributor><creatorcontrib>Kumar, Kamal</creatorcontrib><creatorcontrib>Desai, Valmik</creatorcontrib><creatorcontrib>Cheng, Li</creatorcontrib><creatorcontrib>Khitrov, Maxim</creatorcontrib><creatorcontrib>Grover, Deepak</creatorcontrib><creatorcontrib>Satya, Ravi Vijaya</creatorcontrib><creatorcontrib>Yu, Chenggang</creatorcontrib><creatorcontrib>Zavaljevski, Nela</creatorcontrib><creatorcontrib>Reifman, Jaques</creatorcontrib><title>AGeS: a software system for microbial genome sequence annotation</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance. The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a &gt;94% overlap in the number of identified genes, a significant number of identical annotated features, and a &gt;90% agreement in enzyme function predictions.</description><subject>Acids</subject><subject>Age</subject><subject>Annotations</subject><subject>Armed forces</subject><subject>Automation</subject><subject>Bacteria</subject><subject>Base Sequence</subject><subject>Bioinformatics</subject><subject>Biology</subject><subject>Biotechnology</subject><subject>Computer programs</subject><subject>Computer Science</subject><subject>Data processing</subject><subject>Gene sequencing</subject><subject>Genes</subject><subject>Genes, Bacterial - genetics</subject><subject>Genome, Bacterial - genetics</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Integrated software</subject><subject>Linux</subject><subject>Localization</subject><subject>Medical research</subject><subject>Microorganisms</subject><subject>Molecular Sequence Annotation - methods</subject><subject>Next-generation sequencing</subject><subject>Nucleotide sequence</subject><subject>Ontology</subject><subject>Proteins</subject><subject>Reproducibility of Results</subject><subject>Ribonucleic acid</subject><subject>RNA</subject><subject>Software</subject><subject>Software development tools</subject><subject>Software upgrading</subject><subject>Telemedicine</subject><subject>Web applications</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk12L1DAUhoso7jr6D0QLguLFjPlO44U4LLoOLCy46m3ItKczHdpkTFJ1_72p012msqDkouHked8m5yPLnmK0wFTiNzvXe2vaxd5ZWCCEJRPqXnaKFSVzQRC9f7Q_yR6FsEOI00KIh9kJwQwVBMvT7P3yHK7e5iYPro4_jYc8XIcIXV47n3dN6d26MW2-Aeu6dAbfe7Al5MZaF01snH2cPahNG-DJ-J1lXz9--HL2aX5xeb46W17MS0lonGNAjJZEkjVXGCleEkKhqLBUqKAmBRiRpiaCEhCUGbVGmCkqqgpMUawpp7Ps-cF337qgx8cHjUmSCoILlojVgaic2em9bzrjr7Uzjf4TcH6jjY9N2YLmQklSo4JjQxmuSkM5llQoLoCjKt1olr0b_9avO6hKsNGbdmI6PbHNVm_cD00RU1KQZPBqNPAu5SxE3TWhhLY1FlwftEKEckUI-ydZcMkKTvCQghd_kXenYaQ2Jr20sbVLFywHT71kUihEqZKJWtxBpVVBqnpqqbpJ8Yng9USQmAi_4sb0IejV1ef_Zy-_TdmXR-wWTBu3wbX90FthCrIDmHoyBA_1bTUw0sNE3GRDDxOhx4lIsmfHlbwV3YwA_Q3Q_gIQ</recordid><startdate>20110307</startdate><enddate>20110307</enddate><creator>Kumar, Kamal</creator><creator>Desai, Valmik</creator><creator>Cheng, Li</creator><creator>Khitrov, Maxim</creator><creator>Grover, Deepak</creator><creator>Satya, Ravi Vijaya</creator><creator>Yu, Chenggang</creator><creator>Zavaljevski, Nela</creator><creator>Reifman, Jaques</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>7T7</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20110307</creationdate><title>AGeS: a software system for microbial genome sequence annotation</title><author>Kumar, Kamal ; Desai, Valmik ; Cheng, Li ; Khitrov, Maxim ; Grover, Deepak ; Satya, Ravi Vijaya ; Yu, Chenggang ; Zavaljevski, Nela ; Reifman, Jaques</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Acids</topic><topic>Age</topic><topic>Annotations</topic><topic>Armed forces</topic><topic>Automation</topic><topic>Bacteria</topic><topic>Base Sequence</topic><topic>Bioinformatics</topic><topic>Biology</topic><topic>Biotechnology</topic><topic>Computer programs</topic><topic>Computer Science</topic><topic>Data processing</topic><topic>Gene sequencing</topic><topic>Genes</topic><topic>Genes, Bacterial - genetics</topic><topic>Genome, Bacterial - genetics</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Integrated software</topic><topic>Linux</topic><topic>Localization</topic><topic>Medical research</topic><topic>Microorganisms</topic><topic>Molecular Sequence Annotation - methods</topic><topic>Next-generation sequencing</topic><topic>Nucleotide sequence</topic><topic>Ontology</topic><topic>Proteins</topic><topic>Reproducibility of Results</topic><topic>Ribonucleic acid</topic><topic>RNA</topic><topic>Software</topic><topic>Software development tools</topic><topic>Software upgrading</topic><topic>Telemedicine</topic><topic>Web applications</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kumar, Kamal</creatorcontrib><creatorcontrib>Desai, Valmik</creatorcontrib><creatorcontrib>Cheng, Li</creatorcontrib><creatorcontrib>Khitrov, Maxim</creatorcontrib><creatorcontrib>Grover, Deepak</creatorcontrib><creatorcontrib>Satya, Ravi Vijaya</creatorcontrib><creatorcontrib>Yu, Chenggang</creatorcontrib><creatorcontrib>Zavaljevski, Nela</creatorcontrib><creatorcontrib>Reifman, Jaques</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale in Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>ProQuest Nursing and Allied Health Journals</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kumar, Kamal</au><au>Desai, Valmik</au><au>Cheng, Li</au><au>Khitrov, Maxim</au><au>Grover, Deepak</au><au>Satya, Ravi Vijaya</au><au>Yu, Chenggang</au><au>Zavaljevski, Nela</au><au>Reifman, Jaques</au><au>Aerts, Stein</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AGeS: a software system for microbial genome sequence annotation</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2011-03-07</date><risdate>2011</risdate><volume>6</volume><issue>3</issue><spage>e17469</spage><epage>e17469</epage><pages>e17469-e17469</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance. The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a &gt;94% overlap in the number of identified genes, a significant number of identical annotated features, and a &gt;90% agreement in enzyme function predictions.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>21408217</pmid><doi>10.1371/journal.pone.0017469</doi><tpages>e17469</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2011-03, Vol.6 (3), p.e17469-e17469
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_1295462184
source PubMed Central(OpenAccess); Publicly Available Content (ProQuest)
subjects Acids
Age
Annotations
Armed forces
Automation
Bacteria
Base Sequence
Bioinformatics
Biology
Biotechnology
Computer programs
Computer Science
Data processing
Gene sequencing
Genes
Genes, Bacterial - genetics
Genome, Bacterial - genetics
Genomes
Genomics
Integrated software
Linux
Localization
Medical research
Microorganisms
Molecular Sequence Annotation - methods
Next-generation sequencing
Nucleotide sequence
Ontology
Proteins
Reproducibility of Results
Ribonucleic acid
RNA
Software
Software development tools
Software upgrading
Telemedicine
Web applications
title AGeS: a software system for microbial genome sequence annotation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T22%3A14%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AGeS:%20a%20software%20system%20for%20microbial%20genome%20sequence%20annotation&rft.jtitle=PloS%20one&rft.au=Kumar,%20Kamal&rft.date=2011-03-07&rft.volume=6&rft.issue=3&rft.spage=e17469&rft.epage=e17469&rft.pages=e17469-e17469&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0017469&rft_dat=%3Cgale_plos_%3EA476903397%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1295462184&rft_id=info:pmid/21408217&rft_galeid=A476903397&rfr_iscdi=true