Loading…

Recognizing the pseudogenes in bacterial genomes

Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogene...

Full description

Saved in:
Bibliographic Details
Published in:Nucleic acids research 2005-01, Vol.33 (10), p.3125-3132
Main Authors: Lerat, Emmanuelle, Ochman, Howard
Format: Article
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3
cites
container_end_page 3132
container_issue 10
container_start_page 3125
container_title Nucleic acids research
container_volume 33
creator Lerat, Emmanuelle
Ochman, Howard
description Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.
doi_str_mv 10.1093/nar/gki631
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_1142405</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>21237075</sourcerecordid><originalsourceid>FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3</originalsourceid><addsrcrecordid>eNqFkd9r1EAQxxdR7Fl98Q-Q4IOgEDuzP5OXQinqiQdCqUV8WfaSSW7bXHLuJkX9690jR9W--DQw85nvfJkvY88R3iKU4qR34aS98VrgA7ZAoXkuS80fsgUIUDmCLI7YkxivAVCiko_ZEapSCA5mweCCqqHt_S_ft9m4oWwXaaqHlnqKme-ztatGCt51WWoNW4pP2aPGdZGeHeox-_L-3eX5Ml99_vDx_GyVVwr0mK9NzZGAo0JU2gHV0vCmgdK4slSVrBunVaVNrUswVEiuRCHXSAU2NSZQHLPTWXc3rbdUV9SPwXV2F_zWhZ92cN7-O-n9xrbDrUWUXIJKAq9ngc29teXZyu57AJIbY_QtJvbV4VgYvk8UR7v1saKucz0NU7TalIAF5_8FOXJhwOyvv7wHXg9T6NPHLAfQZWGKIkFvZqgKQ4yBmjufCHYfrU3R2jnaBL_4-yF_0EOWCchnwMeRftzNXbhJ_oVRdvn1m70yVwDm06W9EL8BL7ytbw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>200698788</pqid></control><display><type>article</type><title>Recognizing the pseudogenes in bacterial genomes</title><source>Oxford Journals Open Access Collection</source><source>PubMed Central (PMC)</source><creator>Lerat, Emmanuelle ; Ochman, Howard</creator><creatorcontrib>Lerat, Emmanuelle ; Ochman, Howard</creatorcontrib><description>Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gki631</identifier><identifier>PMID: 15933207</identifier><identifier>CODEN: NARHAD</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Genome, Bacterial ; Life Sciences ; Other ; Pseudogenes ; Staphylococcus aureus ; Staphylococcus aureus - genetics ; Streptococcus ; Streptococcus pyogenes - genetics ; Vibrio - genetics ; Vibrio vulnificus ; Yersinia - genetics ; Yersinia pestis</subject><ispartof>Nucleic acids research, 2005-01, Vol.33 (10), p.3125-3132</ispartof><rights>Copyright Oxford University Press(England) 2005</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><rights>The Author 2005. Published by Oxford University Press. All rights reserved 2005</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3</citedby><orcidid>0000-0001-6757-8796</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC1142405/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC1142405/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27903,27904,53769,53771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15933207$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-00427776$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Lerat, Emmanuelle</creatorcontrib><creatorcontrib>Ochman, Howard</creatorcontrib><title>Recognizing the pseudogenes in bacterial genomes</title><title>Nucleic acids research</title><addtitle>Nucl. Acids Res</addtitle><description>Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.</description><subject>Genome, Bacterial</subject><subject>Life Sciences</subject><subject>Other</subject><subject>Pseudogenes</subject><subject>Staphylococcus aureus</subject><subject>Staphylococcus aureus - genetics</subject><subject>Streptococcus</subject><subject>Streptococcus pyogenes - genetics</subject><subject>Vibrio - genetics</subject><subject>Vibrio vulnificus</subject><subject>Yersinia - genetics</subject><subject>Yersinia pestis</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqFkd9r1EAQxxdR7Fl98Q-Q4IOgEDuzP5OXQinqiQdCqUV8WfaSSW7bXHLuJkX9690jR9W--DQw85nvfJkvY88R3iKU4qR34aS98VrgA7ZAoXkuS80fsgUIUDmCLI7YkxivAVCiko_ZEapSCA5mweCCqqHt_S_ft9m4oWwXaaqHlnqKme-ztatGCt51WWoNW4pP2aPGdZGeHeox-_L-3eX5Ml99_vDx_GyVVwr0mK9NzZGAo0JU2gHV0vCmgdK4slSVrBunVaVNrUswVEiuRCHXSAU2NSZQHLPTWXc3rbdUV9SPwXV2F_zWhZ92cN7-O-n9xrbDrUWUXIJKAq9ngc29teXZyu57AJIbY_QtJvbV4VgYvk8UR7v1saKucz0NU7TalIAF5_8FOXJhwOyvv7wHXg9T6NPHLAfQZWGKIkFvZqgKQ4yBmjufCHYfrU3R2jnaBL_4-yF_0EOWCchnwMeRftzNXbhJ_oVRdvn1m70yVwDm06W9EL8BL7ytbw</recordid><startdate>20050101</startdate><enddate>20050101</enddate><creator>Lerat, Emmanuelle</creator><creator>Ochman, Howard</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QL</scope><scope>7QO</scope><scope>7QP</scope><scope>7QR</scope><scope>7SS</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>1XC</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-6757-8796</orcidid></search><sort><creationdate>20050101</creationdate><title>Recognizing the pseudogenes in bacterial genomes</title><author>Lerat, Emmanuelle ; Ochman, Howard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Genome, Bacterial</topic><topic>Life Sciences</topic><topic>Other</topic><topic>Pseudogenes</topic><topic>Staphylococcus aureus</topic><topic>Staphylococcus aureus - genetics</topic><topic>Streptococcus</topic><topic>Streptococcus pyogenes - genetics</topic><topic>Vibrio - genetics</topic><topic>Vibrio vulnificus</topic><topic>Yersinia - genetics</topic><topic>Yersinia pestis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lerat, Emmanuelle</creatorcontrib><creatorcontrib>Ochman, Howard</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lerat, Emmanuelle</au><au>Ochman, Howard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recognizing the pseudogenes in bacterial genomes</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucl. Acids Res</addtitle><date>2005-01-01</date><risdate>2005</risdate><volume>33</volume><issue>10</issue><spage>3125</spage><epage>3132</epage><pages>3125-3132</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><coden>NARHAD</coden><abstract>Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>15933207</pmid><doi>10.1093/nar/gki631</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0001-6757-8796</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0305-1048
ispartof Nucleic acids research, 2005-01, Vol.33 (10), p.3125-3132
issn 0305-1048
1362-4962
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_1142405
source Oxford Journals Open Access Collection; PubMed Central (PMC)
subjects Genome, Bacterial
Life Sciences
Other
Pseudogenes
Staphylococcus aureus
Staphylococcus aureus - genetics
Streptococcus
Streptococcus pyogenes - genetics
Vibrio - genetics
Vibrio vulnificus
Yersinia - genetics
Yersinia pestis
title Recognizing the pseudogenes in bacterial genomes
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T11%3A19%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recognizing%20the%20pseudogenes%20in%20bacterial%20genomes&rft.jtitle=Nucleic%20acids%20research&rft.au=Lerat,%20Emmanuelle&rft.date=2005-01-01&rft.volume=33&rft.issue=10&rft.spage=3125&rft.epage=3132&rft.pages=3125-3132&rft.issn=0305-1048&rft.eissn=1362-4962&rft.coden=NARHAD&rft_id=info:doi/10.1093/nar/gki631&rft_dat=%3Cproquest_pubme%3E21237075%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=200698788&rft_id=info:pmid/15933207&rfr_iscdi=true