TL;DR
This paper investigates the presence of degree bias in embedding-based knowledge graph completion and introduces KG-Mixup, a data augmentation technique, to mitigate this bias and improve performance.
Contribution
It is the first to validate degree bias in embedding-based KGC and proposes a novel augmentation method to address it.
Findings
KG-Mixup improves KGC accuracy across datasets
Mitigates degree bias for low-degree nodes
Outperforms existing bias mitigation methods
Abstract
A fundamental task for knowledge graphs (KGs) is knowledge graph completion (KGC). It aims to predict unseen edges by learning representations for all the entities and relations in a KG. A common concern when learning representations on traditional graphs is degree bias. It can affect graph algorithms by learning poor representations for lower-degree nodes, often leading to low performance on such nodes. However, there has been limited research on whether there exists degree bias for embedding-based KGC and how such bias affects the performance of KGC. In this paper, we validate the existence of degree bias in embedding-based KGC and identify the key factor to degree bias. We then introduce a novel data augmentation method, KG-Mixup, to generate synthetic triples to mitigate such bias. Extensive experiments have demonstrated that our method can improve various embedding-based KGC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
