Loading…

Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains

[Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•User...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2014-10, Vol.51, p.254-271
Main Authors: Walk, Simon, Singer, Philipp, Strohmaier, Markus, Tudorache, Tania, Musen, Mark A., Noy, Natalya F.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63
cites cdi_FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63
container_end_page 271
container_issue
container_start_page 254
container_title Journal of biomedical informatics
container_volume 51
creator Walk, Simon
Singer, Philipp
Strohmaier, Markus
Tudorache, Tania
Musen, Mark A.
Noy, Natalya F.
description [Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•Users perform property-based workflows. Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the International Classification of Diseases, which is currently under active development by the World Health Organization contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to large-scale projects that require effective collaboration between dozens or even hundreds of experts, practitioners and other stakeholders. Understanding the way these different stakeholders collaborate will enable us to improve editing environments that support such collaborations. In this paper, we uncover how large ontology-engineering projects, such as the International Classification of Diseases in its 11th revision, unfold by analyzing usage logs of five different biomedical ontology-engineering projects of varying sizes and scopes using Markov chains. We discover intriguing interaction patterns (e.g., which properties users frequently change after specific given ones) that suggest that large collaborative ontology-engineering projects are governed by a few general principles that determine and drive development. From our analysis, we identify commonalities and differences between different projects that have implications for project managers, ontology editors, developers and contributors working on collaborative ontology-engineering projects and tools in the biomedical domain.
doi_str_mv 10.1016/j.jbi.2014.06.004
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4194274</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046414001427</els_id><sourcerecordid>1612287122</sourcerecordid><originalsourceid>FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63</originalsourceid><addsrcrecordid>eNp9Uctu2zAQJIoGder0A3IJdOxFCpeiXihQoHGeQAL7kJxyIChqZVOVyYSUBfjvS8Ou0V4CAiQXOzNczhByDjQBCvlll3S1ThgFntA8oZR_IqeQpSymvKSfj_ecT8hX7ztKAbIs_0ImjFehxdkpeb3WXtkRnTbL6ArlgCZayGHlI22ime17WVsnBz1iNDeD7e1yG9-YpTa4pyyc7VANPtr4Xfkk3W87RrOV1MafkZNW9h6_Hc4pebm9eZ7dx4_zu4fZr8dYZQyGmIMseF2VmLGW1ZksZFsALdMUy5KqkkILvGnqYrdSjnXW1E1RVbQCBqpq8nRKfu513zb1GhuFZnCyF29Or6XbCiu1-L9j9Eos7Sg4VJwVPAh8Pwg4-75BP4h1cAXD5w3ajReQA2NlEbYAhT1UOeu9w_b4DFCxC0V0IoQidqEImosQSuBc_DvfkfE3hQD4sQdgcGnU6IRXGo3CRrtgrmis_kD-D8UOnos</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1612287122</pqid></control><display><type>article</type><title>Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains</title><source>ScienceDirect Freedom Collection</source><creator>Walk, Simon ; Singer, Philipp ; Strohmaier, Markus ; Tudorache, Tania ; Musen, Mark A. ; Noy, Natalya F.</creator><creatorcontrib>Walk, Simon ; Singer, Philipp ; Strohmaier, Markus ; Tudorache, Tania ; Musen, Mark A. ; Noy, Natalya F.</creatorcontrib><description>[Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•Users perform property-based workflows. Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the International Classification of Diseases, which is currently under active development by the World Health Organization contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to large-scale projects that require effective collaboration between dozens or even hundreds of experts, practitioners and other stakeholders. Understanding the way these different stakeholders collaborate will enable us to improve editing environments that support such collaborations. In this paper, we uncover how large ontology-engineering projects, such as the International Classification of Diseases in its 11th revision, unfold by analyzing usage logs of five different biomedical ontology-engineering projects of varying sizes and scopes using Markov chains. We discover intriguing interaction patterns (e.g., which properties users frequently change after specific given ones) that suggest that large collaborative ontology-engineering projects are governed by a few general principles that determine and drive development. From our analysis, we identify commonalities and differences between different projects that have implications for project managers, ontology editors, developers and contributors working on collaborative ontology-engineering projects and tools in the biomedical domain.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2014.06.004</identifier><identifier>PMID: 24953242</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Artificial Intelligence ; Biological Ontologies ; Collaboration ; Collaborative ontology engineering ; Computer Simulation ; Cooperative Behavior ; Data Interpretation, Statistical ; International Classification of Diseases - classification ; International Classification of Diseases - organization &amp; administration ; Internationality ; Markov Chains ; Models, Statistical ; Natural Language Processing ; Ontology-engineering tool ; Pattern Recognition, Automated - methods ; Semantics ; Sequential patterns ; User interface</subject><ispartof>Journal of biomedical informatics, 2014-10, Vol.51, p.254-271</ispartof><rights>2014 Elsevier Inc.</rights><rights>Copyright © 2014 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63</citedby><cites>FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24953242$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Walk, Simon</creatorcontrib><creatorcontrib>Singer, Philipp</creatorcontrib><creatorcontrib>Strohmaier, Markus</creatorcontrib><creatorcontrib>Tudorache, Tania</creatorcontrib><creatorcontrib>Musen, Mark A.</creatorcontrib><creatorcontrib>Noy, Natalya F.</creatorcontrib><title>Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•Users perform property-based workflows. Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the International Classification of Diseases, which is currently under active development by the World Health Organization contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to large-scale projects that require effective collaboration between dozens or even hundreds of experts, practitioners and other stakeholders. Understanding the way these different stakeholders collaborate will enable us to improve editing environments that support such collaborations. In this paper, we uncover how large ontology-engineering projects, such as the International Classification of Diseases in its 11th revision, unfold by analyzing usage logs of five different biomedical ontology-engineering projects of varying sizes and scopes using Markov chains. We discover intriguing interaction patterns (e.g., which properties users frequently change after specific given ones) that suggest that large collaborative ontology-engineering projects are governed by a few general principles that determine and drive development. From our analysis, we identify commonalities and differences between different projects that have implications for project managers, ontology editors, developers and contributors working on collaborative ontology-engineering projects and tools in the biomedical domain.</description><subject>Artificial Intelligence</subject><subject>Biological Ontologies</subject><subject>Collaboration</subject><subject>Collaborative ontology engineering</subject><subject>Computer Simulation</subject><subject>Cooperative Behavior</subject><subject>Data Interpretation, Statistical</subject><subject>International Classification of Diseases - classification</subject><subject>International Classification of Diseases - organization &amp; administration</subject><subject>Internationality</subject><subject>Markov Chains</subject><subject>Models, Statistical</subject><subject>Natural Language Processing</subject><subject>Ontology-engineering tool</subject><subject>Pattern Recognition, Automated - methods</subject><subject>Semantics</subject><subject>Sequential patterns</subject><subject>User interface</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNp9Uctu2zAQJIoGder0A3IJdOxFCpeiXihQoHGeQAL7kJxyIChqZVOVyYSUBfjvS8Ou0V4CAiQXOzNczhByDjQBCvlll3S1ThgFntA8oZR_IqeQpSymvKSfj_ecT8hX7ztKAbIs_0ImjFehxdkpeb3WXtkRnTbL6ArlgCZayGHlI22ime17WVsnBz1iNDeD7e1yG9-YpTa4pyyc7VANPtr4Xfkk3W87RrOV1MafkZNW9h6_Hc4pebm9eZ7dx4_zu4fZr8dYZQyGmIMseF2VmLGW1ZksZFsALdMUy5KqkkILvGnqYrdSjnXW1E1RVbQCBqpq8nRKfu513zb1GhuFZnCyF29Or6XbCiu1-L9j9Eos7Sg4VJwVPAh8Pwg4-75BP4h1cAXD5w3ajReQA2NlEbYAhT1UOeu9w_b4DFCxC0V0IoQidqEImosQSuBc_DvfkfE3hQD4sQdgcGnU6IRXGo3CRrtgrmis_kD-D8UOnos</recordid><startdate>20141001</startdate><enddate>20141001</enddate><creator>Walk, Simon</creator><creator>Singer, Philipp</creator><creator>Strohmaier, Markus</creator><creator>Tudorache, Tania</creator><creator>Musen, Mark A.</creator><creator>Noy, Natalya F.</creator><general>Elsevier Inc</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20141001</creationdate><title>Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains</title><author>Walk, Simon ; Singer, Philipp ; Strohmaier, Markus ; Tudorache, Tania ; Musen, Mark A. ; Noy, Natalya F.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Artificial Intelligence</topic><topic>Biological Ontologies</topic><topic>Collaboration</topic><topic>Collaborative ontology engineering</topic><topic>Computer Simulation</topic><topic>Cooperative Behavior</topic><topic>Data Interpretation, Statistical</topic><topic>International Classification of Diseases - classification</topic><topic>International Classification of Diseases - organization &amp; administration</topic><topic>Internationality</topic><topic>Markov Chains</topic><topic>Models, Statistical</topic><topic>Natural Language Processing</topic><topic>Ontology-engineering tool</topic><topic>Pattern Recognition, Automated - methods</topic><topic>Semantics</topic><topic>Sequential patterns</topic><topic>User interface</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Walk, Simon</creatorcontrib><creatorcontrib>Singer, Philipp</creatorcontrib><creatorcontrib>Strohmaier, Markus</creatorcontrib><creatorcontrib>Tudorache, Tania</creatorcontrib><creatorcontrib>Musen, Mark A.</creatorcontrib><creatorcontrib>Noy, Natalya F.</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Walk, Simon</au><au>Singer, Philipp</au><au>Strohmaier, Markus</au><au>Tudorache, Tania</au><au>Musen, Mark A.</au><au>Noy, Natalya F.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2014-10-01</date><risdate>2014</risdate><volume>51</volume><spage>254</spage><epage>271</epage><pages>254-271</pages><issn>1532-0464</issn><eissn>1532-0480</eissn><abstract>[Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•Users perform property-based workflows. Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the International Classification of Diseases, which is currently under active development by the World Health Organization contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to large-scale projects that require effective collaboration between dozens or even hundreds of experts, practitioners and other stakeholders. Understanding the way these different stakeholders collaborate will enable us to improve editing environments that support such collaborations. In this paper, we uncover how large ontology-engineering projects, such as the International Classification of Diseases in its 11th revision, unfold by analyzing usage logs of five different biomedical ontology-engineering projects of varying sizes and scopes using Markov chains. We discover intriguing interaction patterns (e.g., which properties users frequently change after specific given ones) that suggest that large collaborative ontology-engineering projects are governed by a few general principles that determine and drive development. From our analysis, we identify commonalities and differences between different projects that have implications for project managers, ontology editors, developers and contributors working on collaborative ontology-engineering projects and tools in the biomedical domain.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>24953242</pmid><doi>10.1016/j.jbi.2014.06.004</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1532-0464
ispartof Journal of biomedical informatics, 2014-10, Vol.51, p.254-271
issn 1532-0464
1532-0480
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4194274
source ScienceDirect Freedom Collection
subjects Artificial Intelligence
Biological Ontologies
Collaboration
Collaborative ontology engineering
Computer Simulation
Cooperative Behavior
Data Interpretation, Statistical
International Classification of Diseases - classification
International Classification of Diseases - organization & administration
Internationality
Markov Chains
Models, Statistical
Natural Language Processing
Ontology-engineering tool
Pattern Recognition, Automated - methods
Semantics
Sequential patterns
User interface
title Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T21%3A11%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Discovering%20Beaten%20Paths%20in%20Collaborative%20Ontology-Engineering%20Projects%20using%20Markov%20Chains&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Walk,%20Simon&rft.date=2014-10-01&rft.volume=51&rft.spage=254&rft.epage=271&rft.pages=254-271&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2014.06.004&rft_dat=%3Cproquest_pubme%3E1612287122%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c521t-41a74b98e52f2b5a7af710833e880c801f14ddb7b7b734eb5dbd79909121c9d63%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1612287122&rft_id=info:pmid/24953242&rfr_iscdi=true