Towards Understanding the Evolution of Vocabulary Terms in Knowledge Graphs
Mohammad Abdel-Qader, Ansgar Scherp

TL;DR
This paper analyzes how vocabulary terms in knowledge graphs evolve over time, examining change frequency, adoption patterns, and the impact of vocabulary updates on large-scale datasets like Wikidata and the Billion Triples Challenge.
Contribution
It provides a comprehensive analysis of vocabulary change dynamics and adoption timelines in knowledge graphs, highlighting implications for data modeling and querying.
Findings
Change frequency of vocabulary terms is low but impactful.
Many deprecated terms are still in use by data publishers.
Adoption times vary from days to years, with some pre-adoption observations.
Abstract
Vocabularies are used for modeling data in Knowledge Graphs (KG) like the Linked Open Data Cloud and Wikidata. During their lifetime, the vocabularies of the KGs are subject to changes. New terms are coined, while existing terms are modified or declared as deprecated. We first quantify the amount and frequency of changes in vocabularies. Subsequently, we investigate to which extend and when the changes are adopted in the evolution of the KGs. We conduct our experiments on three large-scale KGs for which time-stamped snapshots are available, namely the Billion Triples Challenge datasets, Dynamic Linked Data Observatory dataset, and Wikidata. Our results show that the change frequency of terms is rather low, but can have high impact when adopted on a large amount of distributed graph data on the web. Furthermore, not all coined terms are used and most of the deprecated terms are still…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies · Advanced Graph Neural Networks
