TL;DR
This paper introduces CAGED, a novel unsupervised framework that uses contrastive learning and hyper-view augmentation to improve the detection of errors in knowledge graphs, outperforming existing methods.
Contribution
The paper proposes a new contrastive learning-based approach for KG error detection that does not rely on negative sampling or domain-specific rules.
Findings
CAGED outperforms state-of-the-art methods on three real-world KGs.
It effectively models KG with hyper-view augmentation and contrastive learning.
The framework demonstrates robustness without requiring labeled data.
Abstract
Knowledge Graph (KG) errors introduce non-negligible noise, severely affecting KG-related downstream tasks. Detecting errors in KGs is challenging since the patterns of errors are unknown and diverse, while ground-truth labels are rare or even unavailable. A traditional solution is to construct logical rules to verify triples, but it is not generalizable since different KGs have distinct rules with domain knowledge involved. Recent studies focus on designing tailored detectors or ranking triples based on KG embedding loss. However, they all rely on negative samples for training, which are generated by randomly replacing the head or tail entity of existing triples. Such a negative sampling strategy is not enough for prototyping practical KG errors, e.g., (Bruce_Lee, place_of_birth, China), in which the three elements are often relevant, although mismatched. We desire a more effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
