Loading…
AGeS: a software system for microbial genome sequence annotation
The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large num...
Saved in:
Published in: | PloS one 2011-03, Vol.6 (3), p.e17469-e17469 |
---|---|
Main Authors: | , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353 |
---|---|
cites | cdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353 |
container_end_page | e17469 |
container_issue | 3 |
container_start_page | e17469 |
container_title | PloS one |
container_volume | 6 |
creator | Kumar, Kamal Desai, Valmik Cheng, Li Khitrov, Maxim Grover, Deepak Satya, Ravi Vijaya Yu, Chenggang Zavaljevski, Nela Reifman, Jaques |
description | The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance.
The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions. |
doi_str_mv | 10.1371/journal.pone.0017469 |
format | article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1295462184</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A476903397</galeid><doaj_id>oai_doaj_org_article_56972f0851a341dca351736956e50d08</doaj_id><sourcerecordid>A476903397</sourcerecordid><originalsourceid>FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</originalsourceid><addsrcrecordid>eNqNk12L1DAUhoso7jr6D0QLguLFjPlO44U4LLoOLCy46m3ItKczHdpkTFJ1_72p012msqDkouHked8m5yPLnmK0wFTiNzvXe2vaxd5ZWCCEJRPqXnaKFSVzQRC9f7Q_yR6FsEOI00KIh9kJwQwVBMvT7P3yHK7e5iYPro4_jYc8XIcIXV47n3dN6d26MW2-Aeu6dAbfe7Al5MZaF01snH2cPahNG-DJ-J1lXz9--HL2aX5xeb46W17MS0lonGNAjJZEkjVXGCleEkKhqLBUqKAmBRiRpiaCEhCUGbVGmCkqqgpMUawpp7Ps-cF337qgx8cHjUmSCoILlojVgaic2em9bzrjr7Uzjf4TcH6jjY9N2YLmQklSo4JjQxmuSkM5llQoLoCjKt1olr0b_9avO6hKsNGbdmI6PbHNVm_cD00RU1KQZPBqNPAu5SxE3TWhhLY1FlwftEKEckUI-ydZcMkKTvCQghd_kXenYaQ2Jr20sbVLFywHT71kUihEqZKJWtxBpVVBqnpqqbpJ8Yng9USQmAi_4sb0IejV1ef_Zy-_TdmXR-wWTBu3wbX90FthCrIDmHoyBA_1bTUw0sNE3GRDDxOhx4lIsmfHlbwV3YwA_Q3Q_gIQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1295462184</pqid></control><display><type>article</type><title>AGeS: a software system for microbial genome sequence annotation</title><source>PubMed Central(OpenAccess)</source><source>Publicly Available Content (ProQuest)</source><creator>Kumar, Kamal ; Desai, Valmik ; Cheng, Li ; Khitrov, Maxim ; Grover, Deepak ; Satya, Ravi Vijaya ; Yu, Chenggang ; Zavaljevski, Nela ; Reifman, Jaques</creator><contributor>Aerts, Stein</contributor><creatorcontrib>Kumar, Kamal ; Desai, Valmik ; Cheng, Li ; Khitrov, Maxim ; Grover, Deepak ; Satya, Ravi Vijaya ; Yu, Chenggang ; Zavaljevski, Nela ; Reifman, Jaques ; Aerts, Stein</creatorcontrib><description>The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance.
The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0017469</identifier><identifier>PMID: 21408217</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Acids ; Age ; Annotations ; Armed forces ; Automation ; Bacteria ; Base Sequence ; Bioinformatics ; Biology ; Biotechnology ; Computer programs ; Computer Science ; Data processing ; Gene sequencing ; Genes ; Genes, Bacterial - genetics ; Genome, Bacterial - genetics ; Genomes ; Genomics ; Integrated software ; Linux ; Localization ; Medical research ; Microorganisms ; Molecular Sequence Annotation - methods ; Next-generation sequencing ; Nucleotide sequence ; Ontology ; Proteins ; Reproducibility of Results ; Ribonucleic acid ; RNA ; Software ; Software development tools ; Software upgrading ; Telemedicine ; Web applications</subject><ispartof>PloS one, 2011-03, Vol.6 (3), p.e17469-e17469</ispartof><rights>COPYRIGHT 2011 Public Library of Science</rights><rights>Copyright Public Library of Science Mar 2011</rights><rights>This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. 2011</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</citedby><cites>FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1295462184/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1295462184?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/21408217$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Aerts, Stein</contributor><creatorcontrib>Kumar, Kamal</creatorcontrib><creatorcontrib>Desai, Valmik</creatorcontrib><creatorcontrib>Cheng, Li</creatorcontrib><creatorcontrib>Khitrov, Maxim</creatorcontrib><creatorcontrib>Grover, Deepak</creatorcontrib><creatorcontrib>Satya, Ravi Vijaya</creatorcontrib><creatorcontrib>Yu, Chenggang</creatorcontrib><creatorcontrib>Zavaljevski, Nela</creatorcontrib><creatorcontrib>Reifman, Jaques</creatorcontrib><title>AGeS: a software system for microbial genome sequence annotation</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance.
The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions.</description><subject>Acids</subject><subject>Age</subject><subject>Annotations</subject><subject>Armed forces</subject><subject>Automation</subject><subject>Bacteria</subject><subject>Base Sequence</subject><subject>Bioinformatics</subject><subject>Biology</subject><subject>Biotechnology</subject><subject>Computer programs</subject><subject>Computer Science</subject><subject>Data processing</subject><subject>Gene sequencing</subject><subject>Genes</subject><subject>Genes, Bacterial - genetics</subject><subject>Genome, Bacterial - genetics</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Integrated software</subject><subject>Linux</subject><subject>Localization</subject><subject>Medical research</subject><subject>Microorganisms</subject><subject>Molecular Sequence Annotation - methods</subject><subject>Next-generation sequencing</subject><subject>Nucleotide sequence</subject><subject>Ontology</subject><subject>Proteins</subject><subject>Reproducibility of Results</subject><subject>Ribonucleic acid</subject><subject>RNA</subject><subject>Software</subject><subject>Software development tools</subject><subject>Software upgrading</subject><subject>Telemedicine</subject><subject>Web applications</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk12L1DAUhoso7jr6D0QLguLFjPlO44U4LLoOLCy46m3ItKczHdpkTFJ1_72p012msqDkouHked8m5yPLnmK0wFTiNzvXe2vaxd5ZWCCEJRPqXnaKFSVzQRC9f7Q_yR6FsEOI00KIh9kJwQwVBMvT7P3yHK7e5iYPro4_jYc8XIcIXV47n3dN6d26MW2-Aeu6dAbfe7Al5MZaF01snH2cPahNG-DJ-J1lXz9--HL2aX5xeb46W17MS0lonGNAjJZEkjVXGCleEkKhqLBUqKAmBRiRpiaCEhCUGbVGmCkqqgpMUawpp7Ps-cF337qgx8cHjUmSCoILlojVgaic2em9bzrjr7Uzjf4TcH6jjY9N2YLmQklSo4JjQxmuSkM5llQoLoCjKt1olr0b_9avO6hKsNGbdmI6PbHNVm_cD00RU1KQZPBqNPAu5SxE3TWhhLY1FlwftEKEckUI-ydZcMkKTvCQghd_kXenYaQ2Jr20sbVLFywHT71kUihEqZKJWtxBpVVBqnpqqbpJ8Yng9USQmAi_4sb0IejV1ef_Zy-_TdmXR-wWTBu3wbX90FthCrIDmHoyBA_1bTUw0sNE3GRDDxOhx4lIsmfHlbwV3YwA_Q3Q_gIQ</recordid><startdate>20110307</startdate><enddate>20110307</enddate><creator>Kumar, Kamal</creator><creator>Desai, Valmik</creator><creator>Cheng, Li</creator><creator>Khitrov, Maxim</creator><creator>Grover, Deepak</creator><creator>Satya, Ravi Vijaya</creator><creator>Yu, Chenggang</creator><creator>Zavaljevski, Nela</creator><creator>Reifman, Jaques</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>7T7</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20110307</creationdate><title>AGeS: a software system for microbial genome sequence annotation</title><author>Kumar, Kamal ; Desai, Valmik ; Cheng, Li ; Khitrov, Maxim ; Grover, Deepak ; Satya, Ravi Vijaya ; Yu, Chenggang ; Zavaljevski, Nela ; Reifman, Jaques</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Acids</topic><topic>Age</topic><topic>Annotations</topic><topic>Armed forces</topic><topic>Automation</topic><topic>Bacteria</topic><topic>Base Sequence</topic><topic>Bioinformatics</topic><topic>Biology</topic><topic>Biotechnology</topic><topic>Computer programs</topic><topic>Computer Science</topic><topic>Data processing</topic><topic>Gene sequencing</topic><topic>Genes</topic><topic>Genes, Bacterial - genetics</topic><topic>Genome, Bacterial - genetics</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Integrated software</topic><topic>Linux</topic><topic>Localization</topic><topic>Medical research</topic><topic>Microorganisms</topic><topic>Molecular Sequence Annotation - methods</topic><topic>Next-generation sequencing</topic><topic>Nucleotide sequence</topic><topic>Ontology</topic><topic>Proteins</topic><topic>Reproducibility of Results</topic><topic>Ribonucleic acid</topic><topic>RNA</topic><topic>Software</topic><topic>Software development tools</topic><topic>Software upgrading</topic><topic>Telemedicine</topic><topic>Web applications</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kumar, Kamal</creatorcontrib><creatorcontrib>Desai, Valmik</creatorcontrib><creatorcontrib>Cheng, Li</creatorcontrib><creatorcontrib>Khitrov, Maxim</creatorcontrib><creatorcontrib>Grover, Deepak</creatorcontrib><creatorcontrib>Satya, Ravi Vijaya</creatorcontrib><creatorcontrib>Yu, Chenggang</creatorcontrib><creatorcontrib>Zavaljevski, Nela</creatorcontrib><creatorcontrib>Reifman, Jaques</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale in Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>ProQuest Nursing and Allied Health Journals</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing & Allied Health Premium</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kumar, Kamal</au><au>Desai, Valmik</au><au>Cheng, Li</au><au>Khitrov, Maxim</au><au>Grover, Deepak</au><au>Satya, Ravi Vijaya</au><au>Yu, Chenggang</au><au>Zavaljevski, Nela</au><au>Reifman, Jaques</au><au>Aerts, Stein</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AGeS: a software system for microbial genome sequence annotation</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2011-03-07</date><risdate>2011</risdate><volume>6</volume><issue>3</issue><spage>e17469</spage><epage>e17469</epage><pages>e17469-e17469</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance.
The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>21408217</pmid><doi>10.1371/journal.pone.0017469</doi><tpages>e17469</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6203 |
ispartof | PloS one, 2011-03, Vol.6 (3), p.e17469-e17469 |
issn | 1932-6203 1932-6203 |
language | eng |
recordid | cdi_plos_journals_1295462184 |
source | PubMed Central(OpenAccess); Publicly Available Content (ProQuest) |
subjects | Acids Age Annotations Armed forces Automation Bacteria Base Sequence Bioinformatics Biology Biotechnology Computer programs Computer Science Data processing Gene sequencing Genes Genes, Bacterial - genetics Genome, Bacterial - genetics Genomes Genomics Integrated software Linux Localization Medical research Microorganisms Molecular Sequence Annotation - methods Next-generation sequencing Nucleotide sequence Ontology Proteins Reproducibility of Results Ribonucleic acid RNA Software Software development tools Software upgrading Telemedicine Web applications |
title | AGeS: a software system for microbial genome sequence annotation |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T22%3A14%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AGeS:%20a%20software%20system%20for%20microbial%20genome%20sequence%20annotation&rft.jtitle=PloS%20one&rft.au=Kumar,%20Kamal&rft.date=2011-03-07&rft.volume=6&rft.issue=3&rft.spage=e17469&rft.epage=e17469&rft.pages=e17469-e17469&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0017469&rft_dat=%3Cgale_plos_%3EA476903397%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c723t-1e043c272b591095c223e8d179083a095427af2632e634a9b014936ddea88b353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1295462184&rft_id=info:pmid/21408217&rft_galeid=A476903397&rfr_iscdi=true |