Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains
Simon Walk, Philipp Singer, Markus Strohmaier, Tania Tudorache, Mark, A. Musen, Natalya F. Noy

TL;DR
This paper analyzes large biomedical ontology-engineering projects using Markov chains to uncover interaction patterns and principles that govern collaborative development, providing insights for improving collaborative tools and processes.
Contribution
It introduces a novel application of Markov chains to analyze collaboration patterns in large-scale biomedical ontology projects, revealing common principles and differences across projects.
Findings
Interaction patterns in ontology editing were identified.
Large projects are governed by general principles.
Differences between projects inform tool and process improvements.
Abstract
Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases (ICD) as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the ICD, which is currently under active development by the WHO contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
