Loading…

Analysis of Domain Structural Class Using an Automated Class Assignment Protocol

The extent to which the contemporary dataset of protein structures can be segregated into four structural “classes” as originally defined by Levitt & Chothia in 1976 is examined and a simple method presented for the assignment of protein domains into these classes. Assignments are based on known...

Full description

Saved in:
Bibliographic Details
Published in:Journal of molecular biology 1996-09, Vol.262 (2), p.168-185
Main Authors: Michie, Alex D., Orengo, Christine A., Thornton, Janet M.
Format: Article
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c429t-86f0827d2253c9f2cd45dc31a640fb90abd6bdcc281dcff57d83c081778c64283
cites
container_end_page 185
container_issue 2
container_start_page 168
container_title Journal of molecular biology
container_volume 262
creator Michie, Alex D.
Orengo, Christine A.
Thornton, Janet M.
description The extent to which the contemporary dataset of protein structures can be segregated into four structural “classes” as originally defined by Levitt & Chothia in 1976 is examined and a simple method presented for the assignment of protein domains into these classes. Assignments are based on known three-dimensional structures, and for successful assignment it was found that helix/sheet content, contacts between secondary structures and their sequential order had to be used. The procedure attempts to maximise the automatic separation into classes for a dataset of 197 manually classified, non-homologous domains. It was found that approximately 90%of the structures were classified automatically; the remainder were borderline and were left for manual inspection. The method was then applied to a test set of 43 protein domains with similar results. The data support the concept of distinct classes of protein structure, although a few intermediate structures are found, demonstrating that it is possible to define relatively simple parameters complying with commonly accepted nomenclature that automatically define 90% of protein domains with essentially 100% accuracy. However, re-examination of the data also suggested that the previously separate α/β and α+ β classes show considerable overlap and are more naturally represented as a single αβ class. This large αβ class can then be most easily subdivided by consideration of whether the sheets are mainly parallel, antiparallel or mixed. The correlation between structural class and function is discussed, together with the conservation of class within a sequence superfamily. This represents the first step in an automated phenetic description of protein structure complementing the usual phylogenetic approach to protein structure classification.
doi_str_mv 10.1006/jmbi.1996.0506
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_78382126</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0022283696905069</els_id><sourcerecordid>78382126</sourcerecordid><originalsourceid>FETCH-LOGICAL-c429t-86f0827d2253c9f2cd45dc31a640fb90abd6bdcc281dcff57d83c081778c64283</originalsourceid><addsrcrecordid>eNp1kD1v2zAQhomiQeo4WbsV1dTNypGUKGo0nCYtYKABEs8ExQ-DgSSmPKpA_n0k2MhWLje8zx3vHkK-UigpgLh9GbpQ0rYVJdQgPpEVBdlupODyM1kBMLZhkosv5ArxBQBqXslLciklp41sVuRxO-r-DQMW0Rd3cdBhLJ5ymkyeku6LXa8RiwOG8VjosdhOeUays-dgixiO4-DGXDymmKOJ_TW58LpHd3Oua3K4__m8-7XZ_3n4vdvuN6ZibZ4X9CBZYxmruWk9M7aqreFUiwp814LurOisMUxSa7yvGyu5AUmbRhpRzSetyY_T3NcU_04OsxoCGtf3enRxQtVILhllYgbLE2hSREzOq9cUBp3eFAW1KFSLQrUoVIvCueHbefLUDc5-4Gdnc_79lHsdlT6mgOrwxIByoDWb37KbPBFuFvAvuKTQBDcaZ0NyJisbw_8-fwcn1olY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>78382126</pqid></control><display><type>article</type><title>Analysis of Domain Structural Class Using an Automated Class Assignment Protocol</title><source>ScienceDirect Journals</source><creator>Michie, Alex D. ; Orengo, Christine A. ; Thornton, Janet M.</creator><creatorcontrib>Michie, Alex D. ; Orengo, Christine A. ; Thornton, Janet M.</creatorcontrib><description>The extent to which the contemporary dataset of protein structures can be segregated into four structural “classes” as originally defined by Levitt &amp; Chothia in 1976 is examined and a simple method presented for the assignment of protein domains into these classes. Assignments are based on known three-dimensional structures, and for successful assignment it was found that helix/sheet content, contacts between secondary structures and their sequential order had to be used. The procedure attempts to maximise the automatic separation into classes for a dataset of 197 manually classified, non-homologous domains. It was found that approximately 90%of the structures were classified automatically; the remainder were borderline and were left for manual inspection. The method was then applied to a test set of 43 protein domains with similar results. The data support the concept of distinct classes of protein structure, although a few intermediate structures are found, demonstrating that it is possible to define relatively simple parameters complying with commonly accepted nomenclature that automatically define 90% of protein domains with essentially 100% accuracy. However, re-examination of the data also suggested that the previously separate α/β and α+ β classes show considerable overlap and are more naturally represented as a single αβ class. This large αβ class can then be most easily subdivided by consideration of whether the sheets are mainly parallel, antiparallel or mixed. The correlation between structural class and function is discussed, together with the conservation of class within a sequence superfamily. This represents the first step in an automated phenetic description of protein structure complementing the usual phylogenetic approach to protein structure classification.</description><identifier>ISSN: 0022-2836</identifier><identifier>EISSN: 1089-8638</identifier><identifier>DOI: 10.1006/jmbi.1996.0506</identifier><identifier>PMID: 8831787</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>algorithm definition ; Algorithms ; automatic assignment ; automation ; Bacterial Toxins - chemistry ; Capsid - chemistry ; chemical structure ; classification ; Ferredoxins - chemistry ; identification ; molecular conformation ; Mosaic Viruses ; Phospholipases A - chemistry ; Protein Conformation ; protein domain ; protein secondary structure ; Protein Structure, Secondary ; proteins ; Ribonuclease H - chemistry ; Shiga Toxin 1 ; Software ; structural class ; Transforming Growth Factor beta - chemistry</subject><ispartof>Journal of molecular biology, 1996-09, Vol.262 (2), p.168-185</ispartof><rights>1996 Academic Press</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c429t-86f0827d2253c9f2cd45dc31a640fb90abd6bdcc281dcff57d83c081778c64283</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27915,27916</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/8831787$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Michie, Alex D.</creatorcontrib><creatorcontrib>Orengo, Christine A.</creatorcontrib><creatorcontrib>Thornton, Janet M.</creatorcontrib><title>Analysis of Domain Structural Class Using an Automated Class Assignment Protocol</title><title>Journal of molecular biology</title><addtitle>J Mol Biol</addtitle><description>The extent to which the contemporary dataset of protein structures can be segregated into four structural “classes” as originally defined by Levitt &amp; Chothia in 1976 is examined and a simple method presented for the assignment of protein domains into these classes. Assignments are based on known three-dimensional structures, and for successful assignment it was found that helix/sheet content, contacts between secondary structures and their sequential order had to be used. The procedure attempts to maximise the automatic separation into classes for a dataset of 197 manually classified, non-homologous domains. It was found that approximately 90%of the structures were classified automatically; the remainder were borderline and were left for manual inspection. The method was then applied to a test set of 43 protein domains with similar results. The data support the concept of distinct classes of protein structure, although a few intermediate structures are found, demonstrating that it is possible to define relatively simple parameters complying with commonly accepted nomenclature that automatically define 90% of protein domains with essentially 100% accuracy. However, re-examination of the data also suggested that the previously separate α/β and α+ β classes show considerable overlap and are more naturally represented as a single αβ class. This large αβ class can then be most easily subdivided by consideration of whether the sheets are mainly parallel, antiparallel or mixed. The correlation between structural class and function is discussed, together with the conservation of class within a sequence superfamily. This represents the first step in an automated phenetic description of protein structure complementing the usual phylogenetic approach to protein structure classification.</description><subject>algorithm definition</subject><subject>Algorithms</subject><subject>automatic assignment</subject><subject>automation</subject><subject>Bacterial Toxins - chemistry</subject><subject>Capsid - chemistry</subject><subject>chemical structure</subject><subject>classification</subject><subject>Ferredoxins - chemistry</subject><subject>identification</subject><subject>molecular conformation</subject><subject>Mosaic Viruses</subject><subject>Phospholipases A - chemistry</subject><subject>Protein Conformation</subject><subject>protein domain</subject><subject>protein secondary structure</subject><subject>Protein Structure, Secondary</subject><subject>proteins</subject><subject>Ribonuclease H - chemistry</subject><subject>Shiga Toxin 1</subject><subject>Software</subject><subject>structural class</subject><subject>Transforming Growth Factor beta - chemistry</subject><issn>0022-2836</issn><issn>1089-8638</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1996</creationdate><recordtype>article</recordtype><recordid>eNp1kD1v2zAQhomiQeo4WbsV1dTNypGUKGo0nCYtYKABEs8ExQ-DgSSmPKpA_n0k2MhWLje8zx3vHkK-UigpgLh9GbpQ0rYVJdQgPpEVBdlupODyM1kBMLZhkosv5ArxBQBqXslLciklp41sVuRxO-r-DQMW0Rd3cdBhLJ5ymkyeku6LXa8RiwOG8VjosdhOeUays-dgixiO4-DGXDymmKOJ_TW58LpHd3Oua3K4__m8-7XZ_3n4vdvuN6ZibZ4X9CBZYxmruWk9M7aqreFUiwp814LurOisMUxSa7yvGyu5AUmbRhpRzSetyY_T3NcU_04OsxoCGtf3enRxQtVILhllYgbLE2hSREzOq9cUBp3eFAW1KFSLQrUoVIvCueHbefLUDc5-4Gdnc_79lHsdlT6mgOrwxIByoDWb37KbPBFuFvAvuKTQBDcaZ0NyJisbw_8-fwcn1olY</recordid><startdate>19960920</startdate><enddate>19960920</enddate><creator>Michie, Alex D.</creator><creator>Orengo, Christine A.</creator><creator>Thornton, Janet M.</creator><general>Elsevier Ltd</general><scope>FBQ</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>19960920</creationdate><title>Analysis of Domain Structural Class Using an Automated Class Assignment Protocol</title><author>Michie, Alex D. ; Orengo, Christine A. ; Thornton, Janet M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c429t-86f0827d2253c9f2cd45dc31a640fb90abd6bdcc281dcff57d83c081778c64283</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1996</creationdate><topic>algorithm definition</topic><topic>Algorithms</topic><topic>automatic assignment</topic><topic>automation</topic><topic>Bacterial Toxins - chemistry</topic><topic>Capsid - chemistry</topic><topic>chemical structure</topic><topic>classification</topic><topic>Ferredoxins - chemistry</topic><topic>identification</topic><topic>molecular conformation</topic><topic>Mosaic Viruses</topic><topic>Phospholipases A - chemistry</topic><topic>Protein Conformation</topic><topic>protein domain</topic><topic>protein secondary structure</topic><topic>Protein Structure, Secondary</topic><topic>proteins</topic><topic>Ribonuclease H - chemistry</topic><topic>Shiga Toxin 1</topic><topic>Software</topic><topic>structural class</topic><topic>Transforming Growth Factor beta - chemistry</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Michie, Alex D.</creatorcontrib><creatorcontrib>Orengo, Christine A.</creatorcontrib><creatorcontrib>Thornton, Janet M.</creatorcontrib><collection>AGRIS</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of molecular biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Michie, Alex D.</au><au>Orengo, Christine A.</au><au>Thornton, Janet M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Analysis of Domain Structural Class Using an Automated Class Assignment Protocol</atitle><jtitle>Journal of molecular biology</jtitle><addtitle>J Mol Biol</addtitle><date>1996-09-20</date><risdate>1996</risdate><volume>262</volume><issue>2</issue><spage>168</spage><epage>185</epage><pages>168-185</pages><issn>0022-2836</issn><eissn>1089-8638</eissn><abstract>The extent to which the contemporary dataset of protein structures can be segregated into four structural “classes” as originally defined by Levitt &amp; Chothia in 1976 is examined and a simple method presented for the assignment of protein domains into these classes. Assignments are based on known three-dimensional structures, and for successful assignment it was found that helix/sheet content, contacts between secondary structures and their sequential order had to be used. The procedure attempts to maximise the automatic separation into classes for a dataset of 197 manually classified, non-homologous domains. It was found that approximately 90%of the structures were classified automatically; the remainder were borderline and were left for manual inspection. The method was then applied to a test set of 43 protein domains with similar results. The data support the concept of distinct classes of protein structure, although a few intermediate structures are found, demonstrating that it is possible to define relatively simple parameters complying with commonly accepted nomenclature that automatically define 90% of protein domains with essentially 100% accuracy. However, re-examination of the data also suggested that the previously separate α/β and α+ β classes show considerable overlap and are more naturally represented as a single αβ class. This large αβ class can then be most easily subdivided by consideration of whether the sheets are mainly parallel, antiparallel or mixed. The correlation between structural class and function is discussed, together with the conservation of class within a sequence superfamily. This represents the first step in an automated phenetic description of protein structure complementing the usual phylogenetic approach to protein structure classification.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>8831787</pmid><doi>10.1006/jmbi.1996.0506</doi><tpages>18</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0022-2836
ispartof Journal of molecular biology, 1996-09, Vol.262 (2), p.168-185
issn 0022-2836
1089-8638
language eng
recordid cdi_proquest_miscellaneous_78382126
source ScienceDirect Journals
subjects algorithm definition
Algorithms
automatic assignment
automation
Bacterial Toxins - chemistry
Capsid - chemistry
chemical structure
classification
Ferredoxins - chemistry
identification
molecular conformation
Mosaic Viruses
Phospholipases A - chemistry
Protein Conformation
protein domain
protein secondary structure
Protein Structure, Secondary
proteins
Ribonuclease H - chemistry
Shiga Toxin 1
Software
structural class
Transforming Growth Factor beta - chemistry
title Analysis of Domain Structural Class Using an Automated Class Assignment Protocol
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T23%3A13%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Analysis%20of%20Domain%20Structural%20Class%20Using%20an%20Automated%20Class%20Assignment%20Protocol&rft.jtitle=Journal%20of%20molecular%20biology&rft.au=Michie,%20Alex%20D.&rft.date=1996-09-20&rft.volume=262&rft.issue=2&rft.spage=168&rft.epage=185&rft.pages=168-185&rft.issn=0022-2836&rft.eissn=1089-8638&rft_id=info:doi/10.1006/jmbi.1996.0506&rft_dat=%3Cproquest_cross%3E78382126%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c429t-86f0827d2253c9f2cd45dc31a640fb90abd6bdcc281dcff57d83c081778c64283%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=78382126&rft_id=info:pmid/8831787&rfr_iscdi=true