Loading…

Nonlinearities in protein space limit the utility of informatics in protein biophysics

ABSTRACT We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces,...

Full description

Saved in:
Bibliographic Details
Published in:Proteins, structure, function, and bioinformatics structure, function, and bioinformatics, 2015-11, Vol.83 (11), p.1923-1928
Main Author: Rackovsky, S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:ABSTRACT We examine the utility of informatic‐based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces, we demonstrate quantitatively the limits of knowledge‐based correlation between the sequences and structures of proteins. It is shown that there are well‐defined, nonlinear regions of protein space in which dissimilar structures map onto similar sequences (the conformational switch), and dissimilar sequences map onto similar structures (remote homology). These nonlinearities are shown to be quite common—almost half the proteins in our database fall into one or the other of these two regions. They are not anomalies, but rather intrinsic properties of structural encoding in amino acid sequences. It follows that extreme care must be exercised in using bioinformatic data as a basis for computational structure prediction. The implications of these results for protein evolution are examined. Proteins 2015; 83:1923–1928. © 2015 Wiley Periodicals, Inc.
ISSN:0887-3585
1097-0134
DOI:10.1002/prot.24916