Loading…
Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning
Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of th...
Saved in:
Published in: | Methods (San Diego, Calif.) Calif.), 2016-11, Vol.110, p.97-105 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853 |
---|---|
cites | cdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853 |
container_end_page | 105 |
container_issue | |
container_start_page | 97 |
container_title | Methods (San Diego, Calif.) |
container_volume | 110 |
creator | Du, Tianchuan Liao, Li Wu, Cathy H Sun, Bilin |
description | Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features. |
doi_str_mv | 10.1016/j.ymeth.2016.06.001 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1826698710</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1826698710</sourcerecordid><originalsourceid>FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</originalsourceid><addsrcrecordid>eNo9UMFKAzEQDaLYWv0CQXL0snWS7KbZoxSrQkEPeg5pdtamdHdrkkX796Z2FR7zZuDNm-ERcs1gyoDJu81032BcT3kappAA7ISMGZRFVjIBp4c-lxkHLkbkIoQNJAWfqXMy4jOuuCjkmOxfPVbORte1tKupx-CqHrOBqe3aaGykjYnefdO683Tnu4iuzQamro3ozdHhy8U1XbiwRk-D7TzSGk3skxs1bUUrxB3dovGtaz8uyVlttgGvBp6Q98XD2_wpW748Ps_vl5kVUMSMYS2NzTEVKcWsxtJAmlZWKFwJ4GByY8oaBCuUssJUCvK8YBxkIQqhCjEht0ff9PBnjyHqxgWL261pseuDZopLWaoZgyQVR6n1XQgea73zrjF-rxnoQ-Z6o38z14fMNSQAS1s3w4F-1WD1v_MXsvgB0fKA_w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1826698710</pqid></control><display><type>article</type><title>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</title><source>ScienceDirect Freedom Collection 2022-2024</source><creator>Du, Tianchuan ; Liao, Li ; Wu, Cathy H ; Sun, Bilin</creator><creatorcontrib>Du, Tianchuan ; Liao, Li ; Wu, Cathy H ; Sun, Bilin</creatorcontrib><description>Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.</description><identifier>ISSN: 1046-2023</identifier><identifier>EISSN: 1095-9130</identifier><identifier>DOI: 10.1016/j.ymeth.2016.06.001</identifier><identifier>PMID: 27282356</identifier><language>eng</language><publisher>United States</publisher><subject>Amino Acid Sequence - genetics ; Computational Biology - methods ; Machine Learning ; Protein Interaction Mapping - methods ; Protein Interaction Maps - genetics ; Software</subject><ispartof>Methods (San Diego, Calif.), 2016-11, Vol.110, p.97-105</ispartof><rights>Copyright © 2016. Published by Elsevier Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</citedby><cites>FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27282356$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Du, Tianchuan</creatorcontrib><creatorcontrib>Liao, Li</creatorcontrib><creatorcontrib>Wu, Cathy H</creatorcontrib><creatorcontrib>Sun, Bilin</creatorcontrib><title>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</title><title>Methods (San Diego, Calif.)</title><addtitle>Methods</addtitle><description>Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.</description><subject>Amino Acid Sequence - genetics</subject><subject>Computational Biology - methods</subject><subject>Machine Learning</subject><subject>Protein Interaction Mapping - methods</subject><subject>Protein Interaction Maps - genetics</subject><subject>Software</subject><issn>1046-2023</issn><issn>1095-9130</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNo9UMFKAzEQDaLYWv0CQXL0snWS7KbZoxSrQkEPeg5pdtamdHdrkkX796Z2FR7zZuDNm-ERcs1gyoDJu81032BcT3kappAA7ISMGZRFVjIBp4c-lxkHLkbkIoQNJAWfqXMy4jOuuCjkmOxfPVbORte1tKupx-CqHrOBqe3aaGykjYnefdO683Tnu4iuzQamro3ozdHhy8U1XbiwRk-D7TzSGk3skxs1bUUrxB3dovGtaz8uyVlttgGvBp6Q98XD2_wpW748Ps_vl5kVUMSMYS2NzTEVKcWsxtJAmlZWKFwJ4GByY8oaBCuUssJUCvK8YBxkIQqhCjEht0ff9PBnjyHqxgWL261pseuDZopLWaoZgyQVR6n1XQgea73zrjF-rxnoQ-Z6o38z14fMNSQAS1s3w4F-1WD1v_MXsvgB0fKA_w</recordid><startdate>20161101</startdate><enddate>20161101</enddate><creator>Du, Tianchuan</creator><creator>Liao, Li</creator><creator>Wu, Cathy H</creator><creator>Sun, Bilin</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20161101</creationdate><title>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</title><author>Du, Tianchuan ; Liao, Li ; Wu, Cathy H ; Sun, Bilin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Amino Acid Sequence - genetics</topic><topic>Computational Biology - methods</topic><topic>Machine Learning</topic><topic>Protein Interaction Mapping - methods</topic><topic>Protein Interaction Maps - genetics</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Du, Tianchuan</creatorcontrib><creatorcontrib>Liao, Li</creatorcontrib><creatorcontrib>Wu, Cathy H</creatorcontrib><creatorcontrib>Sun, Bilin</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Methods (San Diego, Calif.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Du, Tianchuan</au><au>Liao, Li</au><au>Wu, Cathy H</au><au>Sun, Bilin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</atitle><jtitle>Methods (San Diego, Calif.)</jtitle><addtitle>Methods</addtitle><date>2016-11-01</date><risdate>2016</risdate><volume>110</volume><spage>97</spage><epage>105</epage><pages>97-105</pages><issn>1046-2023</issn><eissn>1095-9130</eissn><abstract>Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.</abstract><cop>United States</cop><pmid>27282356</pmid><doi>10.1016/j.ymeth.2016.06.001</doi><tpages>9</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1046-2023 |
ispartof | Methods (San Diego, Calif.), 2016-11, Vol.110, p.97-105 |
issn | 1046-2023 1095-9130 |
language | eng |
recordid | cdi_proquest_miscellaneous_1826698710 |
source | ScienceDirect Freedom Collection 2022-2024 |
subjects | Amino Acid Sequence - genetics Computational Biology - methods Machine Learning Protein Interaction Mapping - methods Protein Interaction Maps - genetics Software |
title | Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T06%3A02%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20residue-residue%20contact%20matrix%20for%20protein-protein%20interaction%20with%20Fisher%20score%20features%20and%20deep%20learning&rft.jtitle=Methods%20(San%20Diego,%20Calif.)&rft.au=Du,%20Tianchuan&rft.date=2016-11-01&rft.volume=110&rft.spage=97&rft.epage=105&rft.pages=97-105&rft.issn=1046-2023&rft.eissn=1095-9130&rft_id=info:doi/10.1016/j.ymeth.2016.06.001&rft_dat=%3Cproquest_cross%3E1826698710%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1826698710&rft_id=info:pmid/27282356&rfr_iscdi=true |