What is Learned in Knowledge Graph Embeddings?
Michael R. Douglas, Michael Simkin, Omri Ben-Eliezer, Tianqi Wu, Peter, Chin, Trung V. Dang, Andrew Wood

TL;DR
This paper investigates what knowledge graph embedding models truly learn, distinguishing between motif learning, network connectivity, and statistical patterns, and evaluates their contributions to model performance.
Contribution
It introduces tests to differentiate the mechanisms behind KG embedding success and applies them to benchmark datasets, challenging assumptions about rule learning.
Findings
KG models can learn motifs but are affected by noise edges
Performance is influenced by multiple mechanisms, not just rule learning
Proposes improved evaluation protocols for KG models
Abstract
A knowledge graph (KG) is a data structure which represents entities and relations as the vertices and edges of a directed graph with edge types. KGs are an important primitive in modern machine learning and artificial intelligence. Embedding-based models, such as the seminal TransE [Bordes et al., 2013] and the recent PairRE [Chao et al., 2020] are among the most popular and successful approaches for representing KGs and inferring missing edges (link completion). Their relative success is often credited in the literature to their ability to learn logical rules between the relations. In this work, we investigate whether learning rules between relations is indeed what drives the performance of embedding-based methods. We define motif learning and two alternative mechanisms, network learning (based only on the connectivity of the KG, ignoring the relation types), and unstructured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTransE
