Global Entity Disambiguation with BERT

Ikuya Yamada; Koki Washio; Hiroyuki Shindo; Yuji Matsumoto

arXiv:1909.00426·cs.CL·May 3, 2022

Global Entity Disambiguation with BERT

Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto

PDF

Open Access 1 Repo

TL;DR

This paper introduces a global entity disambiguation model leveraging BERT, which considers both words and entities as input tokens, achieving state-of-the-art results across multiple datasets.

Contribution

It presents a novel BERT-based global ED model that sequentially resolves mentions using resolved entities as context, trained on a large Wikipedia-derived corpus.

Findings

01

Achieved state-of-the-art results on five ED datasets

02

Effectively models global context for entity disambiguation

03

Utilizes large-scale Wikipedia data for training

Abstract

We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia. We achieve new state-of-the-art results on five standard ED datasets: AIDA-CoNLL, MSNBC, AQUAINT, ACE2004, and WNED-WIKI. The source code and model checkpoint are available at https://github.com/studio-ousia/luke.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

studio-ousia/luke
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax