Open Knowledge Enrichment for Long-tail Entities
Ermei Cao, Difeng Wang, Jiacheng Huang, Wei Hu

TL;DR
This paper presents a comprehensive method for enriching knowledge bases by predicting missing properties and inferring facts for long-tail entities using open Web data, leveraging prior knowledge from popular entities.
Contribution
It introduces a full-fledged approach specifically designed for long-tail entities, addressing gaps left by existing methods that focus only on link completion or value filling.
Findings
Demonstrates the approach's effectiveness on synthetic and real-world datasets.
Shows superiority over related methods in enriching long-tail entities.
Validates the feasibility of using open Web data for knowledge enrichment.
Abstract
Knowledge bases (KBs) have gradually become a valuable asset for many AI applications. While many current KBs are quite large, they are widely acknowledged as incomplete, especially lacking facts of long-tail entities, e.g., less famous persons. Existing approaches enrich KBs mainly on completing missing links or filling missing values. However, they only tackle a part of the enrichment problem and lack specific considerations regarding long-tail entities. In this paper, we propose a full-fledged approach to knowledge enrichment, which predicts missing properties and infers true facts of long-tail entities from the open Web. Prior knowledge from popular entities is leveraged to improve every enrichment step. Our experiments on the synthetic and real-world datasets and comparison with related work demonstrate the feasibility and superiority of the approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Data Quality and Management · Advanced Graph Neural Networks
