Loading…

Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains

[Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•User...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2014-10, Vol.51, p.254-271
Main Authors: Walk, Simon, Singer, Philipp, Strohmaier, Markus, Tudorache, Tania, Musen, Mark A., Noy, Natalya F.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •We model usage patterns of five different ontology-engineering projects.•Users work in micro-workflows and specific user-roles can be identified.•Class hierarchy influences users’ edit behavior.•Users edit ontologies top-down, breadth-first and prefer closely related classes.•Users perform property-based workflows. Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the International Classification of Diseases, which is currently under active development by the World Health Organization contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to large-scale projects that require effective collaboration between dozens or even hundreds of experts, practitioners and other stakeholders. Understanding the way these different stakeholders collaborate will enable us to improve editing environments that support such collaborations. In this paper, we uncover how large ontology-engineering projects, such as the International Classification of Diseases in its 11th revision, unfold by analyzing usage logs of five different biomedical ontology-engineering projects of varying sizes and scopes using Markov chains. We discover intriguing interaction patterns (e.g., which properties users frequently change after specific given ones) that suggest that large collaborative ontology-engineering projects are governed by a few general principles that determine and drive development. From our analysis, we identify commonalities and differences between different projects that have implications for project managers, ontology editors, developers and contributors working on collaborative ontology-engineering projects and tools in the biomedical domain.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2014.06.004