On Evaluating Embedding Models for Knowledge Base Completion
Yanjie Wang, Daniel Ruffinelli, Rainer Gemulla, Samuel Broscheit,, Christian Meilicke

TL;DR
This paper critically examines the effectiveness of current embedding models for knowledge base completion, revealing that they underperform under more appropriate evaluation protocols and highlighting the need for improved models and assessment methods.
Contribution
It challenges existing evaluation practices, demonstrating that current models are inadequate for knowledge base completion when assessed with better-suited metrics.
Findings
Current models perform poorly on revised evaluation tasks.
Simple rule-based baselines outperform many embedding models.
Existing evaluation protocols favor question answering over knowledge base completion.
Abstract
Knowledge bases contribute to many web search and mining tasks, yet they are often incomplete. To add missing facts to a given knowledge base, various embedding models have been proposed in the recent literature. Perhaps surprisingly, relatively simple models with limited expressiveness often performed remarkably well under today's most commonly used evaluation protocols. In this paper, we explore whether recent models work well for knowledge base completion and argue that the current evaluation protocols are more suited for question answering rather than knowledge base completion. We show that when focusing on a different prediction task for evaluating knowledge base completion, the performance of current embedding models is unsatisfactory even on datasets previously thought to be too easy. This is especially true when embedding models are compared against a simple rule-based baseline.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications
