Loading…
Increasing confidence of protein interactomes using network topological metrics
Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-thr...
Saved in:
Published in: | Bioinformatics 2006-08, Vol.22 (16), p.1998-2004 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73 |
---|---|
cites | cdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73 |
container_end_page | 2004 |
container_issue | 16 |
container_start_page | 1998 |
container_title | Bioinformatics |
container_volume | 22 |
creator | Chen, Jin Hsu, Wynne Lee, Mong Li Ng, See-Kiong |
description | Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online. |
doi_str_mv | 10.1093/bioinformatics/btl335 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_68732397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1107233321</sourcerecordid><originalsourceid>FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</originalsourceid><addsrcrecordid>eNqF0U1LHTEUBuAgFrW2P0EZhHY3NZl8L63UKlzq5hZKNyHJJBKdSa5Jhrb_3th7qbSbrhLIcw455wXgBMEPCEp8bkIK0ac86xpsOTd1wpjugSNEGOwHSOV-u2PGeyIgPgSvS7mHkCJCyAE4RIwLLjk6Arc30WanS4h3nU3Rh9FF67rku01O1YXYhVhd1ram2ZVu-Q2jqz9Sfuhq2qQp3QWrp252NbePvAGvvJ6Ke7s7j8HXq0_ry-t-dfv55vJi1VtKWO29pRhBwTSWkEBvvNdCaIkGRoyTXGAutUF4HPVAKMEOQiKhRGY0XlBmOD4G77d92zcfF1eqmkOxbpp0dGkpigmOByz_D5EkDMlhaPDsH3iflhzbEM20bkIi1hDdIptTKdl5tclh1vmXQlA956L-zkVtc2l1p7vmi5nd-FK1C6KBdzugS1unzzraUF6caAug9Nn1WxdKdT__vOv8oBjHnKrrb9_b8F_W4uN6pRB-AnvZqzI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>198738916</pqid></control><display><type>article</type><title>Increasing confidence of protein interactomes using network topological metrics</title><source>Oxford University Press Open Access</source><creator>Chen, Jin ; Hsu, Wynne ; Lee, Mong Li ; Ng, See-Kiong</creator><creatorcontrib>Chen, Jin ; Hsu, Wynne ; Lee, Mong Li ; Ng, See-Kiong</creatorcontrib><description>Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btl335</identifier><identifier>PMID: 16787971</identifier><identifier>CODEN: BOINFP</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Animals ; Biological and medical sciences ; Caenorhabditis elegans ; Computational Biology - methods ; Computer Simulation ; Databases, Protein ; Drosophila ; False Positive Reactions ; Fundamental and applied biological sciences. Psychology ; General aspects ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Protein Binding ; Protein Interaction Mapping ; Proteomics - methods ; Saccharomyces cerevisiae - metabolism ; Sequence Analysis, Protein ; Software ; Two-Hybrid System Techniques</subject><ispartof>Bioinformatics, 2006-08, Vol.22 (16), p.1998-2004</ispartof><rights>2006 INIST-CNRS</rights><rights>Copyright Oxford University Press(England) Aug 15, 2006</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</citedby><cites>FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=18049551$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16787971$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Chen, Jin</creatorcontrib><creatorcontrib>Hsu, Wynne</creatorcontrib><creatorcontrib>Lee, Mong Li</creatorcontrib><creatorcontrib>Ng, See-Kiong</creatorcontrib><title>Increasing confidence of protein interactomes using network topological metrics</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.</description><subject>Animals</subject><subject>Biological and medical sciences</subject><subject>Caenorhabditis elegans</subject><subject>Computational Biology - methods</subject><subject>Computer Simulation</subject><subject>Databases, Protein</subject><subject>Drosophila</subject><subject>False Positive Reactions</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Protein Binding</subject><subject>Protein Interaction Mapping</subject><subject>Proteomics - methods</subject><subject>Saccharomyces cerevisiae - metabolism</subject><subject>Sequence Analysis, Protein</subject><subject>Software</subject><subject>Two-Hybrid System Techniques</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><recordid>eNqF0U1LHTEUBuAgFrW2P0EZhHY3NZl8L63UKlzq5hZKNyHJJBKdSa5Jhrb_3th7qbSbrhLIcw455wXgBMEPCEp8bkIK0ac86xpsOTd1wpjugSNEGOwHSOV-u2PGeyIgPgSvS7mHkCJCyAE4RIwLLjk6Arc30WanS4h3nU3Rh9FF67rku01O1YXYhVhd1ram2ZVu-Q2jqz9Sfuhq2qQp3QWrp252NbePvAGvvJ6Ke7s7j8HXq0_ry-t-dfv55vJi1VtKWO29pRhBwTSWkEBvvNdCaIkGRoyTXGAutUF4HPVAKMEOQiKhRGY0XlBmOD4G77d92zcfF1eqmkOxbpp0dGkpigmOByz_D5EkDMlhaPDsH3iflhzbEM20bkIi1hDdIptTKdl5tclh1vmXQlA956L-zkVtc2l1p7vmi5nd-FK1C6KBdzugS1unzzraUF6caAug9Nn1WxdKdT__vOv8oBjHnKrrb9_b8F_W4uN6pRB-AnvZqzI</recordid><startdate>20060815</startdate><enddate>20060815</enddate><creator>Chen, Jin</creator><creator>Hsu, Wynne</creator><creator>Lee, Mong Li</creator><creator>Ng, See-Kiong</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7TO</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7SS</scope><scope>7X8</scope></search><sort><creationdate>20060815</creationdate><title>Increasing confidence of protein interactomes using network topological metrics</title><author>Chen, Jin ; Hsu, Wynne ; Lee, Mong Li ; Ng, See-Kiong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Animals</topic><topic>Biological and medical sciences</topic><topic>Caenorhabditis elegans</topic><topic>Computational Biology - methods</topic><topic>Computer Simulation</topic><topic>Databases, Protein</topic><topic>Drosophila</topic><topic>False Positive Reactions</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Protein Binding</topic><topic>Protein Interaction Mapping</topic><topic>Proteomics - methods</topic><topic>Saccharomyces cerevisiae - metabolism</topic><topic>Sequence Analysis, Protein</topic><topic>Software</topic><topic>Two-Hybrid System Techniques</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Jin</creatorcontrib><creatorcontrib>Hsu, Wynne</creatorcontrib><creatorcontrib>Lee, Mong Li</creatorcontrib><creatorcontrib>Ng, See-Kiong</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Jin</au><au>Hsu, Wynne</au><au>Lee, Mong Li</au><au>Ng, See-Kiong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Increasing confidence of protein interactomes using network topological metrics</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2006-08-15</date><risdate>2006</risdate><volume>22</volume><issue>16</issue><spage>1998</spage><epage>2004</epage><pages>1998-2004</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><coden>BOINFP</coden><abstract>Motivation: Experimental limitations in high-throughput protein–protein interaction detection methods have resulted in low quality interaction datasets that contained sizable fractions of false positives and false negatives. Small-scale, focused experiments are then needed to complement the high-throughput methods to extract true protein interactions. However, the naturally vast interactomes would require much more scalable approaches. Results: We describe a novel method called IRAP* as a computational complement for repurification of the highly erroneous experimentally derived protein interactomes. Our method involves an iterative process of removing interactions that are confidently identified as false positives and adding interactions detected as false negatives into the interactomes. Identification of both false positives and false negatives are performed in IRAP* using interaction confidence measures based on network topological metrics. Potential false positives are identified amongst the detected interactions as those with very low computed confidence values, while potential false negatives are discovered as the undetected interactions with high computed confidence values. Our results from applying IRAP* on large-scale interaction datasets generated by the popular yeast-two-hybrid assays for yeast, fruit fly and worm showed that the computationally repurified interaction datasets contained potentially lower fractions of false positive and false negative errors based on functional homogeneity. Availability: The confidence indices for PPIs in yeast, fruit fly and worm as computed by our method can be found at our website Contact:skng@i2r.a-star.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>16787971</pmid><doi>10.1093/bioinformatics/btl335</doi><tpages>7</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1367-4803 |
ispartof | Bioinformatics, 2006-08, Vol.22 (16), p.1998-2004 |
issn | 1367-4803 1460-2059 1367-4811 |
language | eng |
recordid | cdi_proquest_miscellaneous_68732397 |
source | Oxford University Press Open Access |
subjects | Animals Biological and medical sciences Caenorhabditis elegans Computational Biology - methods Computer Simulation Databases, Protein Drosophila False Positive Reactions Fundamental and applied biological sciences. Psychology General aspects Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Protein Binding Protein Interaction Mapping Proteomics - methods Saccharomyces cerevisiae - metabolism Sequence Analysis, Protein Software Two-Hybrid System Techniques |
title | Increasing confidence of protein interactomes using network topological metrics |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T16%3A18%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Increasing%20confidence%20of%20protein%20interactomes%20using%20network%20topological%20metrics&rft.jtitle=Bioinformatics&rft.au=Chen,%20Jin&rft.date=2006-08-15&rft.volume=22&rft.issue=16&rft.spage=1998&rft.epage=2004&rft.pages=1998-2004&rft.issn=1367-4803&rft.eissn=1460-2059&rft.coden=BOINFP&rft_id=info:doi/10.1093/bioinformatics/btl335&rft_dat=%3Cproquest_cross%3E1107233321%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c546t-fc531086a39040fbffa88a91264be978379ab13dda24543e0049091bdbf856b73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=198738916&rft_id=info:pmid/16787971&rfr_iscdi=true |