Loading…

Protein complex detection based on partially shared multi-view clustering

Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has...

Full description

Saved in:
Bibliographic Details
Published in:BMC bioinformatics 2016-09, Vol.17 (1), p.371-371, Article 371
Main Authors: Ou-Yang, Le, Zhang, Xiao-Fei, Dai, Dao-Qing, Wu, Meng-Yun, Zhu, Yuan, Liu, Zhiyong, Yan, Hong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3
cites cdi_FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3
container_end_page 371
container_issue 1
container_start_page 371
container_title BMC bioinformatics
container_volume 17
creator Ou-Yang, Le
Zhang, Xiao-Fei
Dai, Dao-Qing
Wu, Meng-Yun
Zhu, Yuan
Liu, Zhiyong
Yan, Hong
description Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC .
doi_str_mv 10.1186/s12859-016-1164-9
format article
fullrecord <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5022186</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A464843851</galeid><sourcerecordid>A464843851</sourcerecordid><originalsourceid>FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3</originalsourceid><addsrcrecordid>eNptkk9vFSEUxSdGY2v1A7gxk7ipi6lchmGYjUnTWH1JE41_1oSBO680DDyBqe23Ly-v1j4jLLi5_M4h3Jyqeg3kBEDw9wmo6IaGAG8AOGuGJ9UhsB4aCqR7-qg-qF6kdEUI9IJ0z6sD2nPaCsYOq9XXGDJaX-swbxze1AYz6myDr0eV0NSl2KiYrXLutk6XKpbevLhsm2uLv2vtlpQxWr9-WT2blEv46v48qn6ef_xx9rm5-PJpdXZ60eiOitz0fBxNjxypadEIgYCipeNkOsqnDjjoXjPBSp8JaBUZcerNSGEwyiBRU3tUfdj5bpZxRqPR56ic3EQ7q3grg7Jy_8bbS7kO17IjlJapFYPje4MYfi2Yspxt0uic8hiWJEHAIIay2oK-_Qe9Ckv05XuFomzoKWfkL7VWDqX1Uyjv6q2pPGW8fKYVHRTq5D9U2QZnq4PHyZb-nuDdnqAwGW_yWi0pydX3b_ss7FgdQ0oRp4d5AJHbrMhdVmTJitxmRQ5F8-bxIB8Uf8LR3gFnkLld</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1824972640</pqid></control><display><type>article</type><title>Protein complex detection based on partially shared multi-view clustering</title><source>Publicly Available Content Database</source><source>PubMed Central</source><creator>Ou-Yang, Le ; Zhang, Xiao-Fei ; Dai, Dao-Qing ; Wu, Meng-Yun ; Zhu, Yuan ; Liu, Zhiyong ; Yan, Hong</creator><creatorcontrib>Ou-Yang, Le ; Zhang, Xiao-Fei ; Dai, Dao-Qing ; Wu, Meng-Yun ; Zhu, Yuan ; Liu, Zhiyong ; Yan, Hong</creatorcontrib><description>Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC .</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-016-1164-9</identifier><identifier>PMID: 27623844</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Cluster Analysis ; Data mining ; Genetic algorithms ; Information Storage and Retrieval ; Physiological aspects ; Protein Interaction Domains and Motifs - immunology ; Protein Interaction Mapping - methods ; Protein-protein interactions</subject><ispartof>BMC bioinformatics, 2016-09, Vol.17 (1), p.371-371, Article 371</ispartof><rights>COPYRIGHT 2016 BioMed Central Ltd.</rights><rights>Copyright BioMed Central 2016</rights><rights>The Author(s) 2016</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3</citedby><cites>FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5022186/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1824972640?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25732,27903,27904,36991,36992,44569,53769,53771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27623844$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ou-Yang, Le</creatorcontrib><creatorcontrib>Zhang, Xiao-Fei</creatorcontrib><creatorcontrib>Dai, Dao-Qing</creatorcontrib><creatorcontrib>Wu, Meng-Yun</creatorcontrib><creatorcontrib>Zhu, Yuan</creatorcontrib><creatorcontrib>Liu, Zhiyong</creatorcontrib><creatorcontrib>Yan, Hong</creatorcontrib><title>Protein complex detection based on partially shared multi-view clustering</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC .</description><subject>Cluster Analysis</subject><subject>Data mining</subject><subject>Genetic algorithms</subject><subject>Information Storage and Retrieval</subject><subject>Physiological aspects</subject><subject>Protein Interaction Domains and Motifs - immunology</subject><subject>Protein Interaction Mapping - methods</subject><subject>Protein-protein interactions</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNptkk9vFSEUxSdGY2v1A7gxk7ipi6lchmGYjUnTWH1JE41_1oSBO680DDyBqe23Ly-v1j4jLLi5_M4h3Jyqeg3kBEDw9wmo6IaGAG8AOGuGJ9UhsB4aCqR7-qg-qF6kdEUI9IJ0z6sD2nPaCsYOq9XXGDJaX-swbxze1AYz6myDr0eV0NSl2KiYrXLutk6XKpbevLhsm2uLv2vtlpQxWr9-WT2blEv46v48qn6ef_xx9rm5-PJpdXZ60eiOitz0fBxNjxypadEIgYCipeNkOsqnDjjoXjPBSp8JaBUZcerNSGEwyiBRU3tUfdj5bpZxRqPR56ic3EQ7q3grg7Jy_8bbS7kO17IjlJapFYPje4MYfi2Yspxt0uic8hiWJEHAIIay2oK-_Qe9Ckv05XuFomzoKWfkL7VWDqX1Uyjv6q2pPGW8fKYVHRTq5D9U2QZnq4PHyZb-nuDdnqAwGW_yWi0pydX3b_ss7FgdQ0oRp4d5AJHbrMhdVmTJitxmRQ5F8-bxIB8Uf8LR3gFnkLld</recordid><startdate>20160913</startdate><enddate>20160913</enddate><creator>Ou-Yang, Le</creator><creator>Zhang, Xiao-Fei</creator><creator>Dai, Dao-Qing</creator><creator>Wu, Meng-Yun</creator><creator>Zhu, Yuan</creator><creator>Liu, Zhiyong</creator><creator>Yan, Hong</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20160913</creationdate><title>Protein complex detection based on partially shared multi-view clustering</title><author>Ou-Yang, Le ; Zhang, Xiao-Fei ; Dai, Dao-Qing ; Wu, Meng-Yun ; Zhu, Yuan ; Liu, Zhiyong ; Yan, Hong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Cluster Analysis</topic><topic>Data mining</topic><topic>Genetic algorithms</topic><topic>Information Storage and Retrieval</topic><topic>Physiological aspects</topic><topic>Protein Interaction Domains and Motifs - immunology</topic><topic>Protein Interaction Mapping - methods</topic><topic>Protein-protein interactions</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ou-Yang, Le</creatorcontrib><creatorcontrib>Zhang, Xiao-Fei</creatorcontrib><creatorcontrib>Dai, Dao-Qing</creatorcontrib><creatorcontrib>Wu, Meng-Yun</creatorcontrib><creatorcontrib>Zhu, Yuan</creatorcontrib><creatorcontrib>Liu, Zhiyong</creatorcontrib><creatorcontrib>Yan, Hong</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest Biological Science Collection</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ou-Yang, Le</au><au>Zhang, Xiao-Fei</au><au>Dai, Dao-Qing</au><au>Wu, Meng-Yun</au><au>Zhu, Yuan</au><au>Liu, Zhiyong</au><au>Yan, Hong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Protein complex detection based on partially shared multi-view clustering</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2016-09-13</date><risdate>2016</risdate><volume>17</volume><issue>1</issue><spage>371</spage><epage>371</epage><pages>371-371</pages><artnum>371</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC .</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>27623844</pmid><doi>10.1186/s12859-016-1164-9</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC bioinformatics, 2016-09, Vol.17 (1), p.371-371, Article 371
issn 1471-2105
1471-2105
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5022186
source Publicly Available Content Database; PubMed Central
subjects Cluster Analysis
Data mining
Genetic algorithms
Information Storage and Retrieval
Physiological aspects
Protein Interaction Domains and Motifs - immunology
Protein Interaction Mapping - methods
Protein-protein interactions
title Protein complex detection based on partially shared multi-view clustering
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T07%3A20%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Protein%20complex%20detection%20based%20on%20partially%20shared%20multi-view%20clustering&rft.jtitle=BMC%20bioinformatics&rft.au=Ou-Yang,%20Le&rft.date=2016-09-13&rft.volume=17&rft.issue=1&rft.spage=371&rft.epage=371&rft.pages=371-371&rft.artnum=371&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-016-1164-9&rft_dat=%3Cgale_pubme%3EA464843851%3C/gale_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c528t-76bbd7e6e2d3ed88e1e832bfd526f5161c7c4848e14813a0bef7db219dade0af3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1824972640&rft_id=info:pmid/27623844&rft_galeid=A464843851&rfr_iscdi=true