Loading…

Prediction of function for the polyprenyl transferase subgroup in the isoprenoid synthase superfamily

The number of available protein sequences has increased exponentially with the advent of high-throughput genomic sequencing, creating a significant challenge for functional annotation. Here, we describe a large-scale study on assigning function to unknown members of the trans -polyprenyl transferase...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the National Academy of Sciences - PNAS 2013-03, Vol.110 (13), p.E1196-E1202
Main Authors: Wallrapp, Frank H, Pan, Jian-Jung, Ramamoorthy, Gurusankar, Almonacid, Daniel E, Hillerich, Brandan S, Seidel, Ronald, Patskovsky, Yury, Babbitt, Patricia C, Almo, Steven C, Jacobson, Matthew P, Poulter, C Dale
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The number of available protein sequences has increased exponentially with the advent of high-throughput genomic sequencing, creating a significant challenge for functional annotation. Here, we describe a large-scale study on assigning function to unknown members of the trans -polyprenyl transferase (E-PTS) subgroup in the isoprenoid synthase superfamily, which provides substrates for the biosynthesis of the more than 55,000 isoprenoid metabolites. Although the mechanism for determining the product chain length for these enzymes is known, there is no simple relationship between function and primary sequence, so that assigning function is challenging. We addressed this challenge through large-scale bioinformatics analysis of >5,000 putative polyprenyl transferases; experimental characterization of the chain-length specificity of 79 diverse members of this group; determination of 27 structures of 19 of these enzymes, including seven cocrystallized with substrate analogs or products; and the development and successful application of a computational approach to predict function that leverages available structural data through homology modeling and docking of possible products into the active site. The crystallographic structures and computational structural models of the enzyme–ligand complexes elucidate the structural basis of specificity. As a result of this study, the percentage of E -PTS sequences similar to functionally annotated ones (BLAST e-value ≤ 1e ⁻⁷⁰) increased from 40.6 to 68.8%, and the percentage of sequences similar to available crystal structures increased from 28.9 to 47.4%. The high accuracy of our blind prediction of newly characterized enzymes indicates the potential to predict function to the complete polyprenyl transferase subgroup of the isoprenoid synthase superfamily computationally.
ISSN:0027-8424
1091-6490
DOI:10.1073/pnas.1300632110