Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation
Laurel Orr, Megan Leszczynski, Simran Arora, Sen Wu, Neel Guha, Xiao, Ling, Christopher Re

TL;DR
Bootleg is a self-supervised named entity disambiguation system that leverages reasoning patterns inspired by human disambiguation strategies, achieving state-of-the-art results and transferring well to related tasks.
Contribution
The paper introduces Bootleg, a novel self-supervised NED approach grounded in explicit reasoning patterns, with a training procedure that enhances disambiguation of tail entities.
Findings
Bootleg outperforms existing NED benchmarks.
Learned representations transfer effectively to other entity-based tasks.
Achieves new state-of-the-art in TACRED relation extraction.
Abstract
A challenge for named entity disambiguation (NED), the task of mapping textual mentions to entities in a knowledge base, is how to disambiguate entities that appear rarely in the training data, termed tail entities. Humans use subtle reasoning patterns based on knowledge of entity facts, relations, and types to disambiguate unfamiliar entities. Inspired by these patterns, we introduce Bootleg, a self-supervised NED system that is explicitly grounded in reasoning patterns for disambiguation. We define core reasoning patterns for disambiguation, create a learning procedure to encourage the self-supervised model to learn the patterns, and show how to use weak supervision to enhance the signals in the training data. Encoding the reasoning patterns in a simple Transformer architecture, Bootleg meets or exceeds state-of-the-art on three NED benchmarks. We further show that the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Web Data Mining and Analysis
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Softmax · Adam · Layer Normalization · Dense Connections · Multi-Head Attention · Label Smoothing
