Analysis of the Impact of Negative Sampling on Link Prediction in   Knowledge Graphs

Bhushan Kotnis; Vivi Nastase

arXiv:1708.06816·cs.AI·March 5, 2018·63 cites

Analysis of the Impact of Negative Sampling on Link Prediction in Knowledge Graphs

Bhushan Kotnis, Vivi Nastase

PDF

Open Access 1 Repo

TL;DR

This paper empirically investigates how different negative sampling strategies affect the quality of knowledge graph embeddings for link prediction, revealing dataset-dependent impacts of sampling methods.

Contribution

It provides a comprehensive comparison of negative sampling techniques, including novel embedding-based methods, across multiple embedding models and benchmark datasets.

Findings

01

Traditional corrupting positives works best on WN18

02

Embedding-based sampling improves results on FB15k

03

Sampling method impact varies significantly between datasets

Abstract

Knowledge graphs are large, useful, but incomplete knowledge repositories. They encode knowledge through entities and relations which define each other through the connective structure of the graph. This has inspired methods for the joint embedding of entities and relations in continuous low-dimensional vector spaces, that can be used to induce new edges in the graph, i.e., link prediction in knowledge graphs. Learning these representations relies on contrasting positive instances with negative ones. Knowledge graphs include only positive relation instances, leaving the door open for a variety of methods for selecting negative examples. In this paper we present an empirical study on the impact of negative sampling on the learned embeddings, assessed through the task of link prediction. We use state-of-the-art knowledge graph embeddings -- \rescal , TransE, DistMult and ComplEX -- and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bhushank/kge-rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Data Quality and Management

MethodsTransE