Loading…

A study of combined structure/sequence profiles

Background: For genome sequencing projects to achieve their full impact on biology and medicine, each protein sequence must be identified with its three-dimensional structure. Fold assignment methods (also called profile and threading methods) attempt to assign sequences to known protein folds by co...

Full description

Saved in:
Bibliographic Details
Published in:Folding & design 1996-01, Vol.1 (6), p.451-461
Main Authors: Elofsson, Arne, Fischer, Daniel, Rice, Danny W., Le Grand, Scott M., Eisenberg, David
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Background: For genome sequencing projects to achieve their full impact on biology and medicine, each protein sequence must be identified with its three-dimensional structure. Fold assignment methods (also called profile and threading methods) attempt to assign sequences to known protein folds by computing the compatibility of sequence to fold. Results: We have extended profile methods for the detection of protein folds having structural similarity but low sequence similarity to sequence probes. Our extension combines sequence substitution tables with structural properties to form a combined profile. The structural properties used in this study include distances between residues, exposed areas, areas buried by polar atoms, and properties of the original three-dimensional profile method. We compared the performance of these combined profiles with different sequence matrices and with the original three-dimensional profile method. To determine the optimal gap penalties and weights used with these profiles, we employed a genetic algorithm. The performance of these combined profiles was tested by cross validation using independent test and training sets. Conclusions: These studies show that the combined profiles perform better than profiles based on either structural or sequence information alone.
ISSN:1359-0278
1878-5808
DOI:10.1016/S1359-0278(96)00061-2