SLHCat: Mapping Wikipedia Categories and Lists to DBpedia by Leveraging Semantic, Lexical, and Hierarchical Features
Zhaoyi Wang, Zhenyang Zhang, Jiaxin Qin, Mizuho Iwaihara

TL;DR
SLHCat is a novel method that leverages structural, lexical, and semantic features to improve Wikipedia category and list mapping to DBpedia, enhancing ontology alignment accuracy.
Contribution
The paper introduces SLHCat, a new approach combining knowledge graph structure and language model fine-tuning for more accurate ontology mapping.
Findings
SLHCat outperforms baseline by 25% in accuracy.
Effective use of semantic and lexical features improves mapping quality.
Method enables large-scale, fine-grained ontology alignment.
Abstract
Wikipedia articles are hierarchically organized through categories and lists, providing one of the most comprehensive and universal taxonomy, but its open creation is causing redundancies and inconsistencies. Assigning DBPedia classes to Wikipedia categories and lists can alleviate the problem, realizing a large knowledge graph which is essential for categorizing digital contents through entity linking and typing. However, the existing approach of CaLiGraph is producing incomplete and non-fine grained mappings. In this paper, we tackle the problem as ontology alignment, where structural information of knowledge graphs and lexical and semantic features of ontology class names are utilized to discover confident mappings, which are in turn utilized for finetuing pretrained language models in a distant supervision fashion. Our method SLHCat consists of two main parts: 1) Automatically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Wikis in Education and Collaboration
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Weight Decay · Dropout · Linear Layer · Layer Normalization · WordPiece · Linear Warmup With Linear Decay · Softmax
