Loading…
Recognizing the pseudogenes in bacterial genomes
Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogene...
Saved in:
Published in: | Nucleic acids research 2005-01, Vol.33 (10), p.3125-3132 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3 |
---|---|
cites | |
container_end_page | 3132 |
container_issue | 10 |
container_start_page | 3125 |
container_title | Nucleic acids research |
container_volume | 33 |
creator | Lerat, Emmanuelle Ochman, Howard |
description | Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes. |
doi_str_mv | 10.1093/nar/gki631 |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_1142405</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>21237075</sourcerecordid><originalsourceid>FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3</originalsourceid><addsrcrecordid>eNqFkd9r1EAQxxdR7Fl98Q-Q4IOgEDuzP5OXQinqiQdCqUV8WfaSSW7bXHLuJkX9690jR9W--DQw85nvfJkvY88R3iKU4qR34aS98VrgA7ZAoXkuS80fsgUIUDmCLI7YkxivAVCiko_ZEapSCA5mweCCqqHt_S_ft9m4oWwXaaqHlnqKme-ztatGCt51WWoNW4pP2aPGdZGeHeox-_L-3eX5Ml99_vDx_GyVVwr0mK9NzZGAo0JU2gHV0vCmgdK4slSVrBunVaVNrUswVEiuRCHXSAU2NSZQHLPTWXc3rbdUV9SPwXV2F_zWhZ92cN7-O-n9xrbDrUWUXIJKAq9ngc29teXZyu57AJIbY_QtJvbV4VgYvk8UR7v1saKucz0NU7TalIAF5_8FOXJhwOyvv7wHXg9T6NPHLAfQZWGKIkFvZqgKQ4yBmjufCHYfrU3R2jnaBL_4-yF_0EOWCchnwMeRftzNXbhJ_oVRdvn1m70yVwDm06W9EL8BL7ytbw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>200698788</pqid></control><display><type>article</type><title>Recognizing the pseudogenes in bacterial genomes</title><source>Oxford Journals Open Access Collection</source><source>PubMed Central (PMC)</source><creator>Lerat, Emmanuelle ; Ochman, Howard</creator><creatorcontrib>Lerat, Emmanuelle ; Ochman, Howard</creatorcontrib><description>Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gki631</identifier><identifier>PMID: 15933207</identifier><identifier>CODEN: NARHAD</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Genome, Bacterial ; Life Sciences ; Other ; Pseudogenes ; Staphylococcus aureus ; Staphylococcus aureus - genetics ; Streptococcus ; Streptococcus pyogenes - genetics ; Vibrio - genetics ; Vibrio vulnificus ; Yersinia - genetics ; Yersinia pestis</subject><ispartof>Nucleic acids research, 2005-01, Vol.33 (10), p.3125-3132</ispartof><rights>Copyright Oxford University Press(England) 2005</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><rights>The Author 2005. Published by Oxford University Press. All rights reserved 2005</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3</citedby><orcidid>0000-0001-6757-8796</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC1142405/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC1142405/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27903,27904,53769,53771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15933207$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-00427776$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Lerat, Emmanuelle</creatorcontrib><creatorcontrib>Ochman, Howard</creatorcontrib><title>Recognizing the pseudogenes in bacterial genomes</title><title>Nucleic acids research</title><addtitle>Nucl. Acids Res</addtitle><description>Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.</description><subject>Genome, Bacterial</subject><subject>Life Sciences</subject><subject>Other</subject><subject>Pseudogenes</subject><subject>Staphylococcus aureus</subject><subject>Staphylococcus aureus - genetics</subject><subject>Streptococcus</subject><subject>Streptococcus pyogenes - genetics</subject><subject>Vibrio - genetics</subject><subject>Vibrio vulnificus</subject><subject>Yersinia - genetics</subject><subject>Yersinia pestis</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqFkd9r1EAQxxdR7Fl98Q-Q4IOgEDuzP5OXQinqiQdCqUV8WfaSSW7bXHLuJkX9690jR9W--DQw85nvfJkvY88R3iKU4qR34aS98VrgA7ZAoXkuS80fsgUIUDmCLI7YkxivAVCiko_ZEapSCA5mweCCqqHt_S_ft9m4oWwXaaqHlnqKme-ztatGCt51WWoNW4pP2aPGdZGeHeox-_L-3eX5Ml99_vDx_GyVVwr0mK9NzZGAo0JU2gHV0vCmgdK4slSVrBunVaVNrUswVEiuRCHXSAU2NSZQHLPTWXc3rbdUV9SPwXV2F_zWhZ92cN7-O-n9xrbDrUWUXIJKAq9ngc29teXZyu57AJIbY_QtJvbV4VgYvk8UR7v1saKucz0NU7TalIAF5_8FOXJhwOyvv7wHXg9T6NPHLAfQZWGKIkFvZqgKQ4yBmjufCHYfrU3R2jnaBL_4-yF_0EOWCchnwMeRftzNXbhJ_oVRdvn1m70yVwDm06W9EL8BL7ytbw</recordid><startdate>20050101</startdate><enddate>20050101</enddate><creator>Lerat, Emmanuelle</creator><creator>Ochman, Howard</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QL</scope><scope>7QO</scope><scope>7QP</scope><scope>7QR</scope><scope>7SS</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>1XC</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-6757-8796</orcidid></search><sort><creationdate>20050101</creationdate><title>Recognizing the pseudogenes in bacterial genomes</title><author>Lerat, Emmanuelle ; Ochman, Howard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Genome, Bacterial</topic><topic>Life Sciences</topic><topic>Other</topic><topic>Pseudogenes</topic><topic>Staphylococcus aureus</topic><topic>Staphylococcus aureus - genetics</topic><topic>Streptococcus</topic><topic>Streptococcus pyogenes - genetics</topic><topic>Vibrio - genetics</topic><topic>Vibrio vulnificus</topic><topic>Yersinia - genetics</topic><topic>Yersinia pestis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lerat, Emmanuelle</creatorcontrib><creatorcontrib>Ochman, Howard</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lerat, Emmanuelle</au><au>Ochman, Howard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recognizing the pseudogenes in bacterial genomes</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucl. Acids Res</addtitle><date>2005-01-01</date><risdate>2005</risdate><volume>33</volume><issue>10</issue><spage>3125</spage><epage>3132</epage><pages>3125-3132</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><coden>NARHAD</coden><abstract>Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>15933207</pmid><doi>10.1093/nar/gki631</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0001-6757-8796</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0305-1048 |
ispartof | Nucleic acids research, 2005-01, Vol.33 (10), p.3125-3132 |
issn | 0305-1048 1362-4962 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_1142405 |
source | Oxford Journals Open Access Collection; PubMed Central (PMC) |
subjects | Genome, Bacterial Life Sciences Other Pseudogenes Staphylococcus aureus Staphylococcus aureus - genetics Streptococcus Streptococcus pyogenes - genetics Vibrio - genetics Vibrio vulnificus Yersinia - genetics Yersinia pestis |
title | Recognizing the pseudogenes in bacterial genomes |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T11%3A19%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recognizing%20the%20pseudogenes%20in%20bacterial%20genomes&rft.jtitle=Nucleic%20acids%20research&rft.au=Lerat,%20Emmanuelle&rft.date=2005-01-01&rft.volume=33&rft.issue=10&rft.spage=3125&rft.epage=3132&rft.pages=3125-3132&rft.issn=0305-1048&rft.eissn=1362-4962&rft.coden=NARHAD&rft_id=info:doi/10.1093/nar/gki631&rft_dat=%3Cproquest_pubme%3E21237075%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c506t-b7d21e02151156a0ed472ff097a995c4dfa65c67d6907e8425384b1e81fd12ff3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=200698788&rft_id=info:pmid/15933207&rfr_iscdi=true |