Domain and Range Aware Synthetic Negatives Generation for Knowledge Graph Embedding Models
Alberto Bernardi, Luca Costabello

TL;DR
This paper introduces a domain and range aware method for generating synthetic negative samples in knowledge graph embedding models, significantly improving their performance on benchmark datasets.
Contribution
It extends existing negative sampling strategies by incorporating domain and range constraints, leading to substantial performance gains.
Findings
+10% MRR improvement on standard datasets
+150% MRR improvement on ontology-backed dataset
Enhanced negative sampling method improves embedding quality
Abstract
Knowledge Graph Embedding models, representing entities and edges in a low-dimensional space, have been extremely successful at solving tasks related to completing and exploring Knowledge Graphs (KGs). One of the key aspects of training most of these models is teaching to discriminate between true statements positives and false ones (negatives). However, the way in which negatives can be defined is not trivial, as facts missing from the KG are not necessarily false and a set of ground truth negatives is hardly ever given. This makes synthetic negative generation a necessity. Different generation strategies can heavily affect the quality of the embeddings, making it a primary aspect to consider. We revamp a strategy that generates corruptions during training respecting the domain and range of relations, we extend its capabilities and we show our methods bring substantial improvement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Artificial Intelligence in Healthcare
MethodsSparse Evolutionary Training
