Loading…
Nonlinearities in protein space limit the utility of informatics in protein biophysics
ABSTRACT We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces,...
Saved in:
Published in: | Proteins, structure, function, and bioinformatics structure, function, and bioinformatics, 2015-11, Vol.83 (11), p.1923-1928 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83 |
---|---|
cites | cdi_FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83 |
container_end_page | 1928 |
container_issue | 11 |
container_start_page | 1923 |
container_title | Proteins, structure, function, and bioinformatics |
container_volume | 83 |
creator | Rackovsky, S. |
description | ABSTRACT
We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces, we demonstrate quantitatively the limits of knowledge‐based correlation between the sequences and structures of proteins. It is shown that there are well‐defined, nonlinear regions of protein space in which dissimilar structures map onto similar sequences (the conformational switch), and dissimilar sequences map onto similar structures (remote homology). These nonlinearities are shown to be quite common—almost half the proteins in our database fall into one or the other of these two regions. They are not anomalies, but rather intrinsic properties of structural encoding in amino acid sequences. It follows that extreme care must be exercised in using bioinformatic data as a basis for computational structure prediction. The implications of these results for protein evolution are examined. Proteins 2015; 83:1923–1928. © 2015 Wiley Periodicals, Inc. |
doi_str_mv | 10.1002/prot.24916 |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4609284</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1722930227</sourcerecordid><originalsourceid>FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83</originalsourceid><addsrcrecordid>eNqN0V1LHDEUBuBQlLpab_oDyoA3Iozm88zkpiB-VVhUZLWXITtmurEzkzHJ2O6_b9Z1F-1F6VXg5DkvyTkIfSb4kGBMj3rv4iHlksAHNCJYFjkmjG-gES7LImeiFFtoO4RHjDFIBh_RFgVGUpmO0P2V6xrbGe1ttCZktssWcSadodeVyRrb2pjFmcmGaBsb55mrk6qdb3W01buOqXX9bB5S9RParHUTzO7ruYPuzs8mJ9_y8fXF5cnxOK9EKSGfMiaE5kQKIHXF-ANooNRIQZikNaaYASHVNBVETQ0HWjMOJQFZQjK6ZDvo6zK3H6ateahMF71uVO9tq_1cOW3V-5vOztQP96w4YElLngL2XwO8expMiKq1oTJNozvjhqBIQQuQIKj8H5oUprRIdO8v-ugG36VJvChOBBOQ1MFSVd6F4E29fjfBarFZtRisetlswl_e_nRNV6tMgCzBL9uY-T-i1M3t9WQVmi97bIjm97pH-58KClYI9f3qQt1MilO4P8fqlv0BwNm9NA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1722415356</pqid></control><display><type>article</type><title>Nonlinearities in protein space limit the utility of informatics in protein biophysics</title><source>Wiley:Jisc Collections:Wiley Read and Publish Open Access 2024-2025 (reading list)</source><creator>Rackovsky, S.</creator><creatorcontrib>Rackovsky, S.</creatorcontrib><description>ABSTRACT
We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces, we demonstrate quantitatively the limits of knowledge‐based correlation between the sequences and structures of proteins. It is shown that there are well‐defined, nonlinear regions of protein space in which dissimilar structures map onto similar sequences (the conformational switch), and dissimilar sequences map onto similar structures (remote homology). These nonlinearities are shown to be quite common—almost half the proteins in our database fall into one or the other of these two regions. They are not anomalies, but rather intrinsic properties of structural encoding in amino acid sequences. It follows that extreme care must be exercised in using bioinformatic data as a basis for computational structure prediction. The implications of these results for protein evolution are examined. Proteins 2015; 83:1923–1928. © 2015 Wiley Periodicals, Inc.</description><identifier>ISSN: 0887-3585</identifier><identifier>EISSN: 1097-0134</identifier><identifier>DOI: 10.1002/prot.24916</identifier><identifier>PMID: 26315852</identifier><language>eng</language><publisher>United States: Blackwell Publishing Ltd</publisher><subject>Amino Acid Sequence ; Biophysical Phenomena ; Computational Biology - methods ; conformational switches ; Databases, Protein ; distant homology ; Fourier Analysis ; Protein Conformation ; Proteins - chemistry ; Sequence Homology, Amino Acid ; sequence space ; structure space</subject><ispartof>Proteins, structure, function, and bioinformatics, 2015-11, Vol.83 (11), p.1923-1928</ispartof><rights>2015 Wiley Periodicals, Inc.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83</citedby><cites>FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26315852$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rackovsky, S.</creatorcontrib><title>Nonlinearities in protein space limit the utility of informatics in protein biophysics</title><title>Proteins, structure, function, and bioinformatics</title><addtitle>Proteins</addtitle><description>ABSTRACT
We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces, we demonstrate quantitatively the limits of knowledge‐based correlation between the sequences and structures of proteins. It is shown that there are well‐defined, nonlinear regions of protein space in which dissimilar structures map onto similar sequences (the conformational switch), and dissimilar sequences map onto similar structures (remote homology). These nonlinearities are shown to be quite common—almost half the proteins in our database fall into one or the other of these two regions. They are not anomalies, but rather intrinsic properties of structural encoding in amino acid sequences. It follows that extreme care must be exercised in using bioinformatic data as a basis for computational structure prediction. The implications of these results for protein evolution are examined. Proteins 2015; 83:1923–1928. © 2015 Wiley Periodicals, Inc.</description><subject>Amino Acid Sequence</subject><subject>Biophysical Phenomena</subject><subject>Computational Biology - methods</subject><subject>conformational switches</subject><subject>Databases, Protein</subject><subject>distant homology</subject><subject>Fourier Analysis</subject><subject>Protein Conformation</subject><subject>Proteins - chemistry</subject><subject>Sequence Homology, Amino Acid</subject><subject>sequence space</subject><subject>structure space</subject><issn>0887-3585</issn><issn>1097-0134</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqN0V1LHDEUBuBQlLpab_oDyoA3Iozm88zkpiB-VVhUZLWXITtmurEzkzHJ2O6_b9Z1F-1F6VXg5DkvyTkIfSb4kGBMj3rv4iHlksAHNCJYFjkmjG-gES7LImeiFFtoO4RHjDFIBh_RFgVGUpmO0P2V6xrbGe1ttCZktssWcSadodeVyRrb2pjFmcmGaBsb55mrk6qdb3W01buOqXX9bB5S9RParHUTzO7ruYPuzs8mJ9_y8fXF5cnxOK9EKSGfMiaE5kQKIHXF-ANooNRIQZikNaaYASHVNBVETQ0HWjMOJQFZQjK6ZDvo6zK3H6ateahMF71uVO9tq_1cOW3V-5vOztQP96w4YElLngL2XwO8expMiKq1oTJNozvjhqBIQQuQIKj8H5oUprRIdO8v-ugG36VJvChOBBOQ1MFSVd6F4E29fjfBarFZtRisetlswl_e_nRNV6tMgCzBL9uY-T-i1M3t9WQVmi97bIjm97pH-58KClYI9f3qQt1MilO4P8fqlv0BwNm9NA</recordid><startdate>201511</startdate><enddate>201511</enddate><creator>Rackovsky, S.</creator><general>Blackwell Publishing Ltd</general><general>Wiley Subscription Services, Inc</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QL</scope><scope>7QO</scope><scope>7QP</scope><scope>7QR</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>201511</creationdate><title>Nonlinearities in protein space limit the utility of informatics in protein biophysics</title><author>Rackovsky, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Amino Acid Sequence</topic><topic>Biophysical Phenomena</topic><topic>Computational Biology - methods</topic><topic>conformational switches</topic><topic>Databases, Protein</topic><topic>distant homology</topic><topic>Fourier Analysis</topic><topic>Protein Conformation</topic><topic>Proteins - chemistry</topic><topic>Sequence Homology, Amino Acid</topic><topic>sequence space</topic><topic>structure space</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rackovsky, S.</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Proteins, structure, function, and bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rackovsky, S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Nonlinearities in protein space limit the utility of informatics in protein biophysics</atitle><jtitle>Proteins, structure, function, and bioinformatics</jtitle><addtitle>Proteins</addtitle><date>2015-11</date><risdate>2015</risdate><volume>83</volume><issue>11</issue><spage>1923</spage><epage>1928</epage><pages>1923-1928</pages><issn>0887-3585</issn><eissn>1097-0134</eissn><abstract>ABSTRACT
We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces, we demonstrate quantitatively the limits of knowledge‐based correlation between the sequences and structures of proteins. It is shown that there are well‐defined, nonlinear regions of protein space in which dissimilar structures map onto similar sequences (the conformational switch), and dissimilar sequences map onto similar structures (remote homology). These nonlinearities are shown to be quite common—almost half the proteins in our database fall into one or the other of these two regions. They are not anomalies, but rather intrinsic properties of structural encoding in amino acid sequences. It follows that extreme care must be exercised in using bioinformatic data as a basis for computational structure prediction. The implications of these results for protein evolution are examined. Proteins 2015; 83:1923–1928. © 2015 Wiley Periodicals, Inc.</abstract><cop>United States</cop><pub>Blackwell Publishing Ltd</pub><pmid>26315852</pmid><doi>10.1002/prot.24916</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0887-3585 |
ispartof | Proteins, structure, function, and bioinformatics, 2015-11, Vol.83 (11), p.1923-1928 |
issn | 0887-3585 1097-0134 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4609284 |
source | Wiley:Jisc Collections:Wiley Read and Publish Open Access 2024-2025 (reading list) |
subjects | Amino Acid Sequence Biophysical Phenomena Computational Biology - methods conformational switches Databases, Protein distant homology Fourier Analysis Protein Conformation Proteins - chemistry Sequence Homology, Amino Acid sequence space structure space |
title | Nonlinearities in protein space limit the utility of informatics in protein biophysics |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-24T01%3A32%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Nonlinearities%20in%20protein%20space%20limit%20the%20utility%20of%20informatics%20in%20protein%20biophysics&rft.jtitle=Proteins,%20structure,%20function,%20and%20bioinformatics&rft.au=Rackovsky,%20S.&rft.date=2015-11&rft.volume=83&rft.issue=11&rft.spage=1923&rft.epage=1928&rft.pages=1923-1928&rft.issn=0887-3585&rft.eissn=1097-0134&rft_id=info:doi/10.1002/prot.24916&rft_dat=%3Cproquest_pubme%3E1722930227%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c5896-b3355a419561fc34d6a622e951392f0203611cbe955f2e462f346816986951a83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1722415356&rft_id=info:pmid/26315852&rfr_iscdi=true |