Extracting Domain-specific Concepts from Large-scale Linked Open Data
Satoshi Kume, Kouji Kozaki

TL;DR
This paper introduces a methodology for extracting domain-specific concepts from large-scale linked open data, aiding the construction of specialized ontologies with minimal manual effort.
Contribution
It presents a novel approach that links LOD vocabulary with domain terms to automatically generate initial domain ontologies.
Findings
Successfully extracted a polymer materials ontology from Wikidata
Demonstrated the method's applicability to general class hierarchy datasets
Evaluated concept coverage using NLP and technical dictionaries
Abstract
We propose a methodology for extracting concepts for a target domain from large-scale linked open data (LOD) to support the construction of domain ontologies providing field-specific knowledge and definitions. The proposed method defines search entities by linking the LOD vocabulary with technical terms related to the target domain. The search entities are then used as a starting point for obtaining upper-level concepts in the LOD, and the occurrences of common upper-level entities and the chain-of-path relationships are examined to determine the range of conceptual connections in the target domain. A technical dictionary index and natural language processing are used to evaluate whether the extracted concepts cover the domain. As an example of extracting a class hierarchy from LOD, we used Wikidata to construct a domain ontology for polymer materials and physical properties. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Graph Neural Networks · Wikis in Education and Collaboration
