Extracting Domain-specific Concepts from Large-scale Linked Open Data

Satoshi Kume; Kouji Kozaki

arXiv:2112.03102·cs.IR·January 31, 2022

Extracting Domain-specific Concepts from Large-scale Linked Open Data

Satoshi Kume, Kouji Kozaki

PDF

Open Access 1 Repo

TL;DR

This paper introduces a methodology for extracting domain-specific concepts from large-scale linked open data, aiding the construction of specialized ontologies with minimal manual effort.

Contribution

It presents a novel approach that links LOD vocabulary with domain terms to automatically generate initial domain ontologies.

Findings

01

Successfully extracted a polymer materials ontology from Wikidata

02

Demonstrated the method's applicability to general class hierarchy datasets

03

Evaluated concept coverage using NLP and technical dictionaries

Abstract

We propose a methodology for extracting concepts for a target domain from large-scale linked open data (LOD) to support the construction of domain ontologies providing field-specific knowledge and definitions. The proposed method defines search entities by linking the LOD vocabulary with technical terms related to the target domain. The search entities are then used as a starting point for obtaining upper-level concepts in the LOD, and the occurrences of common upper-level entities and the chain-of-path relationships are examined to determine the range of conceptual connections in the target domain. A technical dictionary index and natural language processing are used to evaluate whether the extracted concepts cover the domain. As an example of extracting a class hierarchy from LOD, we used Wikidata to construct a domain ontology for polymer materials and physical properties. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kumes/aggraphsearch
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Advanced Graph Neural Networks · Wikis in Education and Collaboration