Loading…
Exploring PSI-MI XML Collections Using DescribeX
PSI-MI has been endorsed by the protein informatics community as a standard XML data exchange format for protein-protein interaction datasets. While many public databases support the standard, there is a degree of heterogeneity in the way the proposed XML schema is interpreted and instantiated by di...
Saved in:
Published in: | Journal of integrative bioinformatics 2007-12, Vol.4 (3), p.123-134 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 134 |
container_issue | 3 |
container_start_page | 123 |
container_title | Journal of integrative bioinformatics |
container_volume | 4 |
creator | Samavi, Reza Consens, Mariano Khatchadourian, Shahan Topaloglou, Thodoros |
description | PSI-MI has been endorsed by the protein informatics community as a standard XML data exchange format for protein-protein interaction datasets. While many public databases support the standard, there is a degree of heterogeneity in the way the proposed XML schema is interpreted and instantiated by different data providers. Analysis of schema instantiation in large collections of XML data is a challenging task that is unsupported by existing tools.
In this study we use DescribeX, a novel visualization technique of (semi-)structured XML formats, to quantitatively and qualitatively analyze PSI-MI XML collections at the instance level with the goal of gaining insights about schema usage and to study specific questions such as: adequacy of controlled vocabularies, detection of common instance patterns, and evolution of different data collections. Our analysis shows DescribeX enhances understanding the instance-level structure of PSI-MI data sources and is a useful tool for standards designers, software developers, and PSI-MI data providers. |
doi_str_mv | 10.2390/biecoll-jib-2007-70 |
format | article |
fullrecord | <record><control><sourceid>walterdegruyter</sourceid><recordid>TN_cdi_walterdegruyter_journals_10_2390_biecoll_jib_2007_7043123</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_2390_biecoll_jib_2007_7043123</sourcerecordid><originalsourceid>FETCH-walterdegruyter_journals_10_2390_biecoll_jib_2007_70431233</originalsourceid><addsrcrecordid>eNqtjtEKgjAYhUcQJOUTdLMXWP1zpnkXmJGQEFTg3VBbMhkam1K9fRN6hM7NOXAOhw-hJYWVxyJYl1JUnVKkkSXxAEISwgQ5NKCM-BsazJBrTANWLNpGITgIkvdTdVq2NT5fUpKlOM9OOLYfoupl1xp8M2O5F6bSshT5Ak0fhTLC_fkc7Q7JNT6SV6F6oe-i1sPHBt50g27tglPgIxv_sXHLxkc2HoLPqMfYHy6-Qy5N9A</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Exploring PSI-MI XML Collections Using DescribeX</title><source>Walter De Gruyter: Open Access Journals</source><creator>Samavi, Reza ; Consens, Mariano ; Khatchadourian, Shahan ; Topaloglou, Thodoros</creator><creatorcontrib>Samavi, Reza ; Consens, Mariano ; Khatchadourian, Shahan ; Topaloglou, Thodoros</creatorcontrib><description>PSI-MI has been endorsed by the protein informatics community as a standard XML data exchange format for protein-protein interaction datasets. While many public databases support the standard, there is a degree of heterogeneity in the way the proposed XML schema is interpreted and instantiated by different data providers. Analysis of schema instantiation in large collections of XML data is a challenging task that is unsupported by existing tools.
In this study we use DescribeX, a novel visualization technique of (semi-)structured XML formats, to quantitatively and qualitatively analyze PSI-MI XML collections at the instance level with the goal of gaining insights about schema usage and to study specific questions such as: adequacy of controlled vocabularies, detection of common instance patterns, and evolution of different data collections. Our analysis shows DescribeX enhances understanding the instance-level structure of PSI-MI data sources and is a useful tool for standards designers, software developers, and PSI-MI data providers.</description><identifier>EISSN: 1613-4516</identifier><identifier>DOI: 10.2390/biecoll-jib-2007-70</identifier><language>eng</language><publisher>IMBio e.V</publisher><ispartof>Journal of integrative bioinformatics, 2007-12, Vol.4 (3), p.123-134</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.degruyter.com/document/doi/10.2390/biecoll-jib-2007-70/pdf$$EPDF$$P50$$Gwalterdegruyter$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.degruyter.com/document/doi/10.2390/biecoll-jib-2007-70/html$$EHTML$$P50$$Gwalterdegruyter$$Hfree_for_read</linktohtml><link.rule.ids>314,777,781,27905,27906,66907,68691</link.rule.ids></links><search><creatorcontrib>Samavi, Reza</creatorcontrib><creatorcontrib>Consens, Mariano</creatorcontrib><creatorcontrib>Khatchadourian, Shahan</creatorcontrib><creatorcontrib>Topaloglou, Thodoros</creatorcontrib><title>Exploring PSI-MI XML Collections Using DescribeX</title><title>Journal of integrative bioinformatics</title><description>PSI-MI has been endorsed by the protein informatics community as a standard XML data exchange format for protein-protein interaction datasets. While many public databases support the standard, there is a degree of heterogeneity in the way the proposed XML schema is interpreted and instantiated by different data providers. Analysis of schema instantiation in large collections of XML data is a challenging task that is unsupported by existing tools.
In this study we use DescribeX, a novel visualization technique of (semi-)structured XML formats, to quantitatively and qualitatively analyze PSI-MI XML collections at the instance level with the goal of gaining insights about schema usage and to study specific questions such as: adequacy of controlled vocabularies, detection of common instance patterns, and evolution of different data collections. Our analysis shows DescribeX enhances understanding the instance-level structure of PSI-MI data sources and is a useful tool for standards designers, software developers, and PSI-MI data providers.</description><issn>1613-4516</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid/><recordid>eNqtjtEKgjAYhUcQJOUTdLMXWP1zpnkXmJGQEFTg3VBbMhkam1K9fRN6hM7NOXAOhw-hJYWVxyJYl1JUnVKkkSXxAEISwgQ5NKCM-BsazJBrTANWLNpGITgIkvdTdVq2NT5fUpKlOM9OOLYfoupl1xp8M2O5F6bSshT5Ak0fhTLC_fkc7Q7JNT6SV6F6oe-i1sPHBt50g27tglPgIxv_sXHLxkc2HoLPqMfYHy6-Qy5N9A</recordid><startdate>20071201</startdate><enddate>20071201</enddate><creator>Samavi, Reza</creator><creator>Consens, Mariano</creator><creator>Khatchadourian, Shahan</creator><creator>Topaloglou, Thodoros</creator><general>IMBio e.V</general><scope/></search><sort><creationdate>20071201</creationdate><title>Exploring PSI-MI XML Collections Using DescribeX</title><author>Samavi, Reza ; Consens, Mariano ; Khatchadourian, Shahan ; Topaloglou, Thodoros</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-walterdegruyter_journals_10_2390_biecoll_jib_2007_70431233</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Samavi, Reza</creatorcontrib><creatorcontrib>Consens, Mariano</creatorcontrib><creatorcontrib>Khatchadourian, Shahan</creatorcontrib><creatorcontrib>Topaloglou, Thodoros</creatorcontrib><jtitle>Journal of integrative bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Samavi, Reza</au><au>Consens, Mariano</au><au>Khatchadourian, Shahan</au><au>Topaloglou, Thodoros</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Exploring PSI-MI XML Collections Using DescribeX</atitle><jtitle>Journal of integrative bioinformatics</jtitle><date>2007-12-01</date><risdate>2007</risdate><volume>4</volume><issue>3</issue><spage>123</spage><epage>134</epage><pages>123-134</pages><eissn>1613-4516</eissn><abstract>PSI-MI has been endorsed by the protein informatics community as a standard XML data exchange format for protein-protein interaction datasets. While many public databases support the standard, there is a degree of heterogeneity in the way the proposed XML schema is interpreted and instantiated by different data providers. Analysis of schema instantiation in large collections of XML data is a challenging task that is unsupported by existing tools.
In this study we use DescribeX, a novel visualization technique of (semi-)structured XML formats, to quantitatively and qualitatively analyze PSI-MI XML collections at the instance level with the goal of gaining insights about schema usage and to study specific questions such as: adequacy of controlled vocabularies, detection of common instance patterns, and evolution of different data collections. Our analysis shows DescribeX enhances understanding the instance-level structure of PSI-MI data sources and is a useful tool for standards designers, software developers, and PSI-MI data providers.</abstract><pub>IMBio e.V</pub><doi>10.2390/biecoll-jib-2007-70</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 1613-4516 |
ispartof | Journal of integrative bioinformatics, 2007-12, Vol.4 (3), p.123-134 |
issn | 1613-4516 |
language | eng |
recordid | cdi_walterdegruyter_journals_10_2390_biecoll_jib_2007_7043123 |
source | Walter De Gruyter: Open Access Journals |
title | Exploring PSI-MI XML Collections Using DescribeX |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T05%3A23%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-walterdegruyter&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Exploring%20PSI-MI%20XML%20Collections%20Using%20DescribeX&rft.jtitle=Journal%20of%20integrative%20bioinformatics&rft.au=Samavi,%20Reza&rft.date=2007-12-01&rft.volume=4&rft.issue=3&rft.spage=123&rft.epage=134&rft.pages=123-134&rft.eissn=1613-4516&rft_id=info:doi/10.2390/biecoll-jib-2007-70&rft_dat=%3Cwalterdegruyter%3E10_2390_biecoll_jib_2007_7043123%3C/walterdegruyter%3E%3Cgrp_id%3Ecdi_FETCH-walterdegruyter_journals_10_2390_biecoll_jib_2007_70431233%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |