Loading…

Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning

Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of th...

Full description

Saved in:
Bibliographic Details
Published in:Methods (San Diego, Calif.) Calif.), 2016-11, Vol.110, p.97-105
Main Authors: Du, Tianchuan, Liao, Li, Wu, Cathy H, Sun, Bilin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853
cites cdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853
container_end_page 105
container_issue
container_start_page 97
container_title Methods (San Diego, Calif.)
container_volume 110
creator Du, Tianchuan
Liao, Li
Wu, Cathy H
Sun, Bilin
description Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.
doi_str_mv 10.1016/j.ymeth.2016.06.001
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1826698710</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1826698710</sourcerecordid><originalsourceid>FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</originalsourceid><addsrcrecordid>eNo9UMFKAzEQDaLYWv0CQXL0snWS7KbZoxSrQkEPeg5pdtamdHdrkkX796Z2FR7zZuDNm-ERcs1gyoDJu81032BcT3kappAA7ISMGZRFVjIBp4c-lxkHLkbkIoQNJAWfqXMy4jOuuCjkmOxfPVbORte1tKupx-CqHrOBqe3aaGykjYnefdO683Tnu4iuzQamro3ozdHhy8U1XbiwRk-D7TzSGk3skxs1bUUrxB3dovGtaz8uyVlttgGvBp6Q98XD2_wpW748Ps_vl5kVUMSMYS2NzTEVKcWsxtJAmlZWKFwJ4GByY8oaBCuUssJUCvK8YBxkIQqhCjEht0ff9PBnjyHqxgWL261pseuDZopLWaoZgyQVR6n1XQgea73zrjF-rxnoQ-Z6o38z14fMNSQAS1s3w4F-1WD1v_MXsvgB0fKA_w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1826698710</pqid></control><display><type>article</type><title>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</title><source>ScienceDirect Freedom Collection 2022-2024</source><creator>Du, Tianchuan ; Liao, Li ; Wu, Cathy H ; Sun, Bilin</creator><creatorcontrib>Du, Tianchuan ; Liao, Li ; Wu, Cathy H ; Sun, Bilin</creatorcontrib><description>Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.</description><identifier>ISSN: 1046-2023</identifier><identifier>EISSN: 1095-9130</identifier><identifier>DOI: 10.1016/j.ymeth.2016.06.001</identifier><identifier>PMID: 27282356</identifier><language>eng</language><publisher>United States</publisher><subject>Amino Acid Sequence - genetics ; Computational Biology - methods ; Machine Learning ; Protein Interaction Mapping - methods ; Protein Interaction Maps - genetics ; Software</subject><ispartof>Methods (San Diego, Calif.), 2016-11, Vol.110, p.97-105</ispartof><rights>Copyright © 2016. Published by Elsevier Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</citedby><cites>FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27282356$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Du, Tianchuan</creatorcontrib><creatorcontrib>Liao, Li</creatorcontrib><creatorcontrib>Wu, Cathy H</creatorcontrib><creatorcontrib>Sun, Bilin</creatorcontrib><title>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</title><title>Methods (San Diego, Calif.)</title><addtitle>Methods</addtitle><description>Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.</description><subject>Amino Acid Sequence - genetics</subject><subject>Computational Biology - methods</subject><subject>Machine Learning</subject><subject>Protein Interaction Mapping - methods</subject><subject>Protein Interaction Maps - genetics</subject><subject>Software</subject><issn>1046-2023</issn><issn>1095-9130</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNo9UMFKAzEQDaLYWv0CQXL0snWS7KbZoxSrQkEPeg5pdtamdHdrkkX796Z2FR7zZuDNm-ERcs1gyoDJu81032BcT3kappAA7ISMGZRFVjIBp4c-lxkHLkbkIoQNJAWfqXMy4jOuuCjkmOxfPVbORte1tKupx-CqHrOBqe3aaGykjYnefdO683Tnu4iuzQamro3ozdHhy8U1XbiwRk-D7TzSGk3skxs1bUUrxB3dovGtaz8uyVlttgGvBp6Q98XD2_wpW748Ps_vl5kVUMSMYS2NzTEVKcWsxtJAmlZWKFwJ4GByY8oaBCuUssJUCvK8YBxkIQqhCjEht0ff9PBnjyHqxgWL261pseuDZopLWaoZgyQVR6n1XQgea73zrjF-rxnoQ-Z6o38z14fMNSQAS1s3w4F-1WD1v_MXsvgB0fKA_w</recordid><startdate>20161101</startdate><enddate>20161101</enddate><creator>Du, Tianchuan</creator><creator>Liao, Li</creator><creator>Wu, Cathy H</creator><creator>Sun, Bilin</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20161101</creationdate><title>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</title><author>Du, Tianchuan ; Liao, Li ; Wu, Cathy H ; Sun, Bilin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Amino Acid Sequence - genetics</topic><topic>Computational Biology - methods</topic><topic>Machine Learning</topic><topic>Protein Interaction Mapping - methods</topic><topic>Protein Interaction Maps - genetics</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Du, Tianchuan</creatorcontrib><creatorcontrib>Liao, Li</creatorcontrib><creatorcontrib>Wu, Cathy H</creatorcontrib><creatorcontrib>Sun, Bilin</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Methods (San Diego, Calif.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Du, Tianchuan</au><au>Liao, Li</au><au>Wu, Cathy H</au><au>Sun, Bilin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning</atitle><jtitle>Methods (San Diego, Calif.)</jtitle><addtitle>Methods</addtitle><date>2016-11-01</date><risdate>2016</risdate><volume>110</volume><spage>97</spage><epage>105</epage><pages>97-105</pages><issn>1046-2023</issn><eissn>1095-9130</eissn><abstract>Protein-protein interactions play essential roles in many biological processes. Acquiring knowledge of the residue-residue contact information of two interacting proteins is not only helpful in annotating functions for proteins, but also critical for structure-based drug design. The prediction of the protein residue-residue contact matrix of the interfacial regions is challenging. In this work, we introduced deep learning techniques (specifically, stacked autoencoders) to build deep neural network models to tackled the residue-residue contact prediction problem. In tandem with interaction profile Hidden Markov Models, which was used first to extract Fisher score features from protein sequences, stacked autoencoders were deployed to extract and learn hidden abstract features. The deep learning model showed significant improvement over the traditional machine learning model, Support Vector Machines (SVM), with the overall accuracy increased by 15% from 65.40% to 80.82%. We showed that the stacked autoencoders could extract novel features, which can be utilized by deep neural networks and other classifiers to enhance learning, out of the Fisher score features. It is further shown that deep neural networks have significant advantages over SVM in making use of the newly extracted features.</abstract><cop>United States</cop><pmid>27282356</pmid><doi>10.1016/j.ymeth.2016.06.001</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1046-2023
ispartof Methods (San Diego, Calif.), 2016-11, Vol.110, p.97-105
issn 1046-2023
1095-9130
language eng
recordid cdi_proquest_miscellaneous_1826698710
source ScienceDirect Freedom Collection 2022-2024
subjects Amino Acid Sequence - genetics
Computational Biology - methods
Machine Learning
Protein Interaction Mapping - methods
Protein Interaction Maps - genetics
Software
title Prediction of residue-residue contact matrix for protein-protein interaction with Fisher score features and deep learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T06%3A02%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20residue-residue%20contact%20matrix%20for%20protein-protein%20interaction%20with%20Fisher%20score%20features%20and%20deep%20learning&rft.jtitle=Methods%20(San%20Diego,%20Calif.)&rft.au=Du,%20Tianchuan&rft.date=2016-11-01&rft.volume=110&rft.spage=97&rft.epage=105&rft.pages=97-105&rft.issn=1046-2023&rft.eissn=1095-9130&rft_id=info:doi/10.1016/j.ymeth.2016.06.001&rft_dat=%3Cproquest_cross%3E1826698710%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c305t-1ef6ac4e6ac6637fe9a04e6bc38eb3020a4aa9f031588c3ad8044512065353853%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1826698710&rft_id=info:pmid/27282356&rfr_iscdi=true