Loading…

Evolution of transcription factor DNA binding sites

In bioinformatics, binding of transcription regulatory factors to the cognate binding sites is usually described by sequence-specific binding energy, which is estimated from a training sample of sites. This model implies that all binding sites with binding energy above some threshold are functional...

Full description

Saved in:
Bibliographic Details
Published in:Gene 2005-03, Vol.347 (2), p.255-263
Main Authors: Kotelnikova, Ekaterina A., Makeev, Vsevolod J., Gelfand, Mikhail S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23
cites cdi_FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23
container_end_page 263
container_issue 2
container_start_page 255
container_title Gene
container_volume 347
creator Kotelnikova, Ekaterina A.
Makeev, Vsevolod J.
Gelfand, Mikhail S.
description In bioinformatics, binding of transcription regulatory factors to the cognate binding sites is usually described by sequence-specific binding energy, which is estimated from a training sample of sites. This model implies that all binding sites with binding energy above some threshold are functional and site sequence variations should be considered neutral until they do not reduce this energy below the threshold. To quantify this energy, the binding profile (positional weight matrix, PWM) model or consensus-based model is usually applied. Here we show that in many cases available data are not sufficient to construct a relevant PWM, and modified consensus-based model could be more effective to describe binding properties. Further, using the data about binding sites of several transcription factors, we demonstrate that some non-consensus nucleotides in “orthologous sites” (that is, binding sites of the same factor upstream of orthologous genes), which have been believed to be irrelevant or even hindering the regulation, are evolutionary very stable and specific for the regulated gene. For each two considered genomes, the number of substitutions between non-consensus nucleotides is far less than the expected number of neutral substitutions. Moreover, in several positions of binding sites regulating different genes, there are non-consensus nucleotides conserved in distant genomes. It means that there exists a selection pressure, which results in the stability of non-consensus nucleotides.
doi_str_mv 10.1016/j.gene.2004.12.013
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_67823437</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0378111904007334</els_id><sourcerecordid>17509567</sourcerecordid><originalsourceid>FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23</originalsourceid><addsrcrecordid>eNqFkM9LwzAUgIMobk7_AQ_Sk7fWl6RtUvAy5vwBQy96DmnyOjK2dibtwP_edht4c7k8CF--Fz5CbikkFGj-sEqWWGPCANKEsgQoPyNjKkURA3B5TsbAhYwppcWIXIWwgv5kGbskI5oJlnEJY8Lnu2bdta6po6aKWq_rYLzb7i8qbdrGR0_v06h0tXX1MgquxXBNLiq9DnhznBPy9Tz_nL3Gi4-Xt9l0EZs0hzaWol-T55pLXRXSWLSpNKmwSLNccCxzTQvEMkMuM80ZVrlGDmBRMLDcMD4h9wfv1jffHYZWbVwwuF7rGpsuqFxIxlMuToJDIcYpPwlSkUEx_G5C2AE0vgnBY6W23m20_1EU1BBfrdQQf69WlCnY2--O9q7coP17cqzdA48HAPtqO4deBeOwNmidR9Mq27j__L--WJQ0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>17509567</pqid></control><display><type>article</type><title>Evolution of transcription factor DNA binding sites</title><source>ScienceDirect Freedom Collection</source><creator>Kotelnikova, Ekaterina A. ; Makeev, Vsevolod J. ; Gelfand, Mikhail S.</creator><creatorcontrib>Kotelnikova, Ekaterina A. ; Makeev, Vsevolod J. ; Gelfand, Mikhail S.</creatorcontrib><description>In bioinformatics, binding of transcription regulatory factors to the cognate binding sites is usually described by sequence-specific binding energy, which is estimated from a training sample of sites. This model implies that all binding sites with binding energy above some threshold are functional and site sequence variations should be considered neutral until they do not reduce this energy below the threshold. To quantify this energy, the binding profile (positional weight matrix, PWM) model or consensus-based model is usually applied. Here we show that in many cases available data are not sufficient to construct a relevant PWM, and modified consensus-based model could be more effective to describe binding properties. Further, using the data about binding sites of several transcription factors, we demonstrate that some non-consensus nucleotides in “orthologous sites” (that is, binding sites of the same factor upstream of orthologous genes), which have been believed to be irrelevant or even hindering the regulation, are evolutionary very stable and specific for the regulated gene. For each two considered genomes, the number of substitutions between non-consensus nucleotides is far less than the expected number of neutral substitutions. Moreover, in several positions of binding sites regulating different genes, there are non-consensus nucleotides conserved in distant genomes. It means that there exists a selection pressure, which results in the stability of non-consensus nucleotides.</description><identifier>ISSN: 0378-1119</identifier><identifier>EISSN: 1879-0038</identifier><identifier>DOI: 10.1016/j.gene.2004.12.013</identifier><identifier>PMID: 15725380</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Base Sequence ; Binding site ; Binding Sites ; Consensus ; Consensus Sequence ; DNA - metabolism ; Evolution ; Evolution, Molecular ; Genomics ; Models, Biological ; Prokaryotic Cells - physiology ; Regulation ; Transcription ; Transcription Factors - genetics ; Transcription Factors - metabolism</subject><ispartof>Gene, 2005-03, Vol.347 (2), p.255-263</ispartof><rights>2004 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23</citedby><cites>FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15725380$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kotelnikova, Ekaterina A.</creatorcontrib><creatorcontrib>Makeev, Vsevolod J.</creatorcontrib><creatorcontrib>Gelfand, Mikhail S.</creatorcontrib><title>Evolution of transcription factor DNA binding sites</title><title>Gene</title><addtitle>Gene</addtitle><description>In bioinformatics, binding of transcription regulatory factors to the cognate binding sites is usually described by sequence-specific binding energy, which is estimated from a training sample of sites. This model implies that all binding sites with binding energy above some threshold are functional and site sequence variations should be considered neutral until they do not reduce this energy below the threshold. To quantify this energy, the binding profile (positional weight matrix, PWM) model or consensus-based model is usually applied. Here we show that in many cases available data are not sufficient to construct a relevant PWM, and modified consensus-based model could be more effective to describe binding properties. Further, using the data about binding sites of several transcription factors, we demonstrate that some non-consensus nucleotides in “orthologous sites” (that is, binding sites of the same factor upstream of orthologous genes), which have been believed to be irrelevant or even hindering the regulation, are evolutionary very stable and specific for the regulated gene. For each two considered genomes, the number of substitutions between non-consensus nucleotides is far less than the expected number of neutral substitutions. Moreover, in several positions of binding sites regulating different genes, there are non-consensus nucleotides conserved in distant genomes. It means that there exists a selection pressure, which results in the stability of non-consensus nucleotides.</description><subject>Base Sequence</subject><subject>Binding site</subject><subject>Binding Sites</subject><subject>Consensus</subject><subject>Consensus Sequence</subject><subject>DNA - metabolism</subject><subject>Evolution</subject><subject>Evolution, Molecular</subject><subject>Genomics</subject><subject>Models, Biological</subject><subject>Prokaryotic Cells - physiology</subject><subject>Regulation</subject><subject>Transcription</subject><subject>Transcription Factors - genetics</subject><subject>Transcription Factors - metabolism</subject><issn>0378-1119</issn><issn>1879-0038</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqFkM9LwzAUgIMobk7_AQ_Sk7fWl6RtUvAy5vwBQy96DmnyOjK2dibtwP_edht4c7k8CF--Fz5CbikkFGj-sEqWWGPCANKEsgQoPyNjKkURA3B5TsbAhYwppcWIXIWwgv5kGbskI5oJlnEJY8Lnu2bdta6po6aKWq_rYLzb7i8qbdrGR0_v06h0tXX1MgquxXBNLiq9DnhznBPy9Tz_nL3Gi4-Xt9l0EZs0hzaWol-T55pLXRXSWLSpNKmwSLNccCxzTQvEMkMuM80ZVrlGDmBRMLDcMD4h9wfv1jffHYZWbVwwuF7rGpsuqFxIxlMuToJDIcYpPwlSkUEx_G5C2AE0vgnBY6W23m20_1EU1BBfrdQQf69WlCnY2--O9q7coP17cqzdA48HAPtqO4deBeOwNmidR9Mq27j__L--WJQ0</recordid><startdate>20050314</startdate><enddate>20050314</enddate><creator>Kotelnikova, Ekaterina A.</creator><creator>Makeev, Vsevolod J.</creator><creator>Gelfand, Mikhail S.</creator><general>Elsevier B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>20050314</creationdate><title>Evolution of transcription factor DNA binding sites</title><author>Kotelnikova, Ekaterina A. ; Makeev, Vsevolod J. ; Gelfand, Mikhail S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Base Sequence</topic><topic>Binding site</topic><topic>Binding Sites</topic><topic>Consensus</topic><topic>Consensus Sequence</topic><topic>DNA - metabolism</topic><topic>Evolution</topic><topic>Evolution, Molecular</topic><topic>Genomics</topic><topic>Models, Biological</topic><topic>Prokaryotic Cells - physiology</topic><topic>Regulation</topic><topic>Transcription</topic><topic>Transcription Factors - genetics</topic><topic>Transcription Factors - metabolism</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kotelnikova, Ekaterina A.</creatorcontrib><creatorcontrib>Makeev, Vsevolod J.</creatorcontrib><creatorcontrib>Gelfand, Mikhail S.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Gene</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kotelnikova, Ekaterina A.</au><au>Makeev, Vsevolod J.</au><au>Gelfand, Mikhail S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evolution of transcription factor DNA binding sites</atitle><jtitle>Gene</jtitle><addtitle>Gene</addtitle><date>2005-03-14</date><risdate>2005</risdate><volume>347</volume><issue>2</issue><spage>255</spage><epage>263</epage><pages>255-263</pages><issn>0378-1119</issn><eissn>1879-0038</eissn><abstract>In bioinformatics, binding of transcription regulatory factors to the cognate binding sites is usually described by sequence-specific binding energy, which is estimated from a training sample of sites. This model implies that all binding sites with binding energy above some threshold are functional and site sequence variations should be considered neutral until they do not reduce this energy below the threshold. To quantify this energy, the binding profile (positional weight matrix, PWM) model or consensus-based model is usually applied. Here we show that in many cases available data are not sufficient to construct a relevant PWM, and modified consensus-based model could be more effective to describe binding properties. Further, using the data about binding sites of several transcription factors, we demonstrate that some non-consensus nucleotides in “orthologous sites” (that is, binding sites of the same factor upstream of orthologous genes), which have been believed to be irrelevant or even hindering the regulation, are evolutionary very stable and specific for the regulated gene. For each two considered genomes, the number of substitutions between non-consensus nucleotides is far less than the expected number of neutral substitutions. Moreover, in several positions of binding sites regulating different genes, there are non-consensus nucleotides conserved in distant genomes. It means that there exists a selection pressure, which results in the stability of non-consensus nucleotides.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>15725380</pmid><doi>10.1016/j.gene.2004.12.013</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0378-1119
ispartof Gene, 2005-03, Vol.347 (2), p.255-263
issn 0378-1119
1879-0038
language eng
recordid cdi_proquest_miscellaneous_67823437
source ScienceDirect Freedom Collection
subjects Base Sequence
Binding site
Binding Sites
Consensus
Consensus Sequence
DNA - metabolism
Evolution
Evolution, Molecular
Genomics
Models, Biological
Prokaryotic Cells - physiology
Regulation
Transcription
Transcription Factors - genetics
Transcription Factors - metabolism
title Evolution of transcription factor DNA binding sites
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T15%3A27%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evolution%20of%20transcription%20factor%20DNA%20binding%20sites&rft.jtitle=Gene&rft.au=Kotelnikova,%20Ekaterina%20A.&rft.date=2005-03-14&rft.volume=347&rft.issue=2&rft.spage=255&rft.epage=263&rft.pages=255-263&rft.issn=0378-1119&rft.eissn=1879-0038&rft_id=info:doi/10.1016/j.gene.2004.12.013&rft_dat=%3Cproquest_cross%3E17509567%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c460t-8715766a38af98cded48c47de15673eb6a19eeb5e385a32ef6ae300de720d3c23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=17509567&rft_id=info:pmid/15725380&rfr_iscdi=true