Loading…

Increasing confidence of protein interactomes using network topological metrics

Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-thr...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics 2006-08, Vol.22 (16), p.1998-2004
Main Authors: Chen, Jin, Hsu, Wynne, Lee, Mong Li, Ng, See-Kiong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73
cites cdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73
container_end_page 2004
container_issue 16
container_start_page 1998
container_title Bioinformatics
container_volume 22
creator Chen, Jin
Hsu, Wynne
Lee, Mong Li
Ng, See-Kiong
description Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.
doi_str_mv 10.1093/bioinformatics/btl335
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_68732397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1107233321</sourcerecordid><originalsourceid>FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</originalsourceid><addsrcrecordid>eNqF0U1LHTEUBuAgFrW2P0EZhHY3NZl8L63UKlzq5hZKNyHJJBKdSa5Jhrb_3th7qbSbrhLIcw455wXgBMEPCEp8bkIK0ac86xpsOTd1wpjugSNEGOwHSOV-u2PGeyIgPgSvS7mHkCJCyAE4RIwLLjk6Arc30WanS4h3nU3Rh9FF67rku01O1YXYhVhd1ram2ZVu-Q2jqz9Sfuhq2qQp3QWrp252NbePvAGvvJ6Ke7s7j8HXq0_ry-t-dfv55vJi1VtKWO29pRhBwTSWkEBvvNdCaIkGRoyTXGAutUF4HPVAKMEOQiKhRGY0XlBmOD4G77d92zcfF1eqmkOxbpp0dGkpigmOByz_D5EkDMlhaPDsH3iflhzbEM20bkIi1hDdIptTKdl5tclh1vmXQlA956L-zkVtc2l1p7vmi5nd-FK1C6KBdzugS1unzzraUF6caAug9Nn1WxdKdT__vOv8oBjHnKrrb9_b8F_W4uN6pRB-AnvZqzI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>198738916</pqid></control><display><type>article</type><title>Increasing confidence of protein interactomes using network topological metrics</title><source>Oxford University Press Open Access</source><creator>Chen, Jin ; Hsu, Wynne ; Lee, Mong Li ; Ng, See-Kiong</creator><creatorcontrib>Chen, Jin ; Hsu, Wynne ; Lee, Mong Li ; Ng, See-Kiong</creatorcontrib><description>Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btl335</identifier><identifier>PMID: 16787971</identifier><identifier>CODEN: BOINFP</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Animals ; Biological and medical sciences ; Caenorhabditis elegans ; Computational Biology - methods ; Computer Simulation ; Databases, Protein ; Drosophila ; False Positive Reactions ; Fundamental and applied biological sciences. Psychology ; General aspects ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Protein Binding ; Protein Interaction Mapping ; Proteomics - methods ; Saccharomyces cerevisiae - metabolism ; Sequence Analysis, Protein ; Software ; Two-Hybrid System Techniques</subject><ispartof>Bioinformatics, 2006-08, Vol.22 (16), p.1998-2004</ispartof><rights>2006 INIST-CNRS</rights><rights>Copyright Oxford University Press(England) Aug 15, 2006</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</citedby><cites>FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=18049551$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16787971$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Chen, Jin</creatorcontrib><creatorcontrib>Hsu, Wynne</creatorcontrib><creatorcontrib>Lee, Mong Li</creatorcontrib><creatorcontrib>Ng, See-Kiong</creatorcontrib><title>Increasing confidence of protein interactomes using network topological metrics</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.</description><subject>Animals</subject><subject>Biological and medical sciences</subject><subject>Caenorhabditis elegans</subject><subject>Computational Biology - methods</subject><subject>Computer Simulation</subject><subject>Databases, Protein</subject><subject>Drosophila</subject><subject>False Positive Reactions</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Protein Binding</subject><subject>Protein Interaction Mapping</subject><subject>Proteomics - methods</subject><subject>Saccharomyces cerevisiae - metabolism</subject><subject>Sequence Analysis, Protein</subject><subject>Software</subject><subject>Two-Hybrid System Techniques</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><recordid>eNqF0U1LHTEUBuAgFrW2P0EZhHY3NZl8L63UKlzq5hZKNyHJJBKdSa5Jhrb_3th7qbSbrhLIcw455wXgBMEPCEp8bkIK0ac86xpsOTd1wpjugSNEGOwHSOV-u2PGeyIgPgSvS7mHkCJCyAE4RIwLLjk6Arc30WanS4h3nU3Rh9FF67rku01O1YXYhVhd1ram2ZVu-Q2jqz9Sfuhq2qQp3QWrp252NbePvAGvvJ6Ke7s7j8HXq0_ry-t-dfv55vJi1VtKWO29pRhBwTSWkEBvvNdCaIkGRoyTXGAutUF4HPVAKMEOQiKhRGY0XlBmOD4G77d92zcfF1eqmkOxbpp0dGkpigmOByz_D5EkDMlhaPDsH3iflhzbEM20bkIi1hDdIptTKdl5tclh1vmXQlA956L-zkVtc2l1p7vmi5nd-FK1C6KBdzugS1unzzraUF6caAug9Nn1WxdKdT__vOv8oBjHnKrrb9_b8F_W4uN6pRB-AnvZqzI</recordid><startdate>20060815</startdate><enddate>20060815</enddate><creator>Chen, Jin</creator><creator>Hsu, Wynne</creator><creator>Lee, Mong Li</creator><creator>Ng, See-Kiong</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7TO</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7SS</scope><scope>7X8</scope></search><sort><creationdate>20060815</creationdate><title>Increasing confidence of protein interactomes using network topological metrics</title><author>Chen, Jin ; Hsu, Wynne ; Lee, Mong Li ; Ng, See-Kiong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Animals</topic><topic>Biological and medical sciences</topic><topic>Caenorhabditis elegans</topic><topic>Computational Biology - methods</topic><topic>Computer Simulation</topic><topic>Databases, Protein</topic><topic>Drosophila</topic><topic>False Positive Reactions</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Protein Binding</topic><topic>Protein Interaction Mapping</topic><topic>Proteomics - methods</topic><topic>Saccharomyces cerevisiae - metabolism</topic><topic>Sequence Analysis, Protein</topic><topic>Software</topic><topic>Two-Hybrid System Techniques</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Jin</creatorcontrib><creatorcontrib>Hsu, Wynne</creatorcontrib><creatorcontrib>Lee, Mong Li</creatorcontrib><creatorcontrib>Ng, See-Kiong</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Jin</au><au>Hsu, Wynne</au><au>Lee, Mong Li</au><au>Ng, See-Kiong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Increasing confidence of protein interactomes using network topological metrics</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2006-08-15</date><risdate>2006</risdate><volume>22</volume><issue>16</issue><spage>1998</spage><epage>2004</epage><pages>1998-2004</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><coden>BOINFP</coden><abstract>Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>16787971</pmid><doi>10.1093/bioinformatics/btl335</doi><tpages>7</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2006-08, Vol.22 (16), p.1998-2004
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_proquest_miscellaneous_68732397
source Oxford University Press Open Access
subjects Animals
Biological and medical sciences
Caenorhabditis elegans
Computational Biology - methods
Computer Simulation
Databases, Protein
Drosophila
False Positive Reactions
Fundamental and applied biological sciences. Psychology
General aspects
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Protein Binding
Protein Interaction Mapping
Proteomics - methods
Saccharomyces cerevisiae - metabolism
Sequence Analysis, Protein
Software
Two-Hybrid System Techniques
title Increasing confidence of protein interactomes using network topological metrics
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T16%3A18%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Increasing%20confidence%20of%20protein%20interactomes%20using%20network%20topological%20metrics&rft.jtitle=Bioinformatics&rft.au=Chen,%20Jin&rft.date=2006-08-15&rft.volume=22&rft.issue=16&rft.spage=1998&rft.epage=2004&rft.pages=1998-2004&rft.issn=1367-4803&rft.eissn=1460-2059&rft.coden=BOINFP&rft_id=info:doi/10.1093/bioinformatics/btl335&rft_dat=%3Cproquest_cross%3E1107233321%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=198738916&rft_id=info:pmid/16787971&rfr_iscdi=true