Multilingual Event Linking to Wikidata

Adithya Pratapa; Rishubh Gupta; Teruko Mitamura

arXiv:2204.06535·cs.CL·July 19, 2022

Multilingual Event Linking to Wikidata

Adithya Pratapa, Rishubh Gupta, Teruko Mitamura

PDF

Open Access 1 Repo 2 Datasets

TL;DR

This paper introduces a large-scale multilingual event linking task to Wikidata, compares multiple models including BM25+ and neural architectures, and evaluates their performance across language and domain variations.

Contribution

It presents a new dataset for multilingual event linking, proposes two task variants, and demonstrates the effectiveness of neural models over traditional methods.

Findings

01

Neural models outperform BM25+ baseline significantly.

02

Crosslingual task is more challenging than multilingual.

03

Models show good generalization on out-of-domain data.

Abstract

We present a task of multilingual linking of events to a knowledge base. We automatically compile a large-scale dataset for this task, comprising of 1.8M mentions across 44 languages referring to over 10.9K events from Wikidata. We propose two variants of the event linking task: 1) multilingual, where event descriptions are from the same language as the mention, and 2) crosslingual, where all event descriptions are in English. On the two proposed tasks, we compare multiple event linking systems including BM25+ (Lv and Zhai, 2011) and multilingual adaptations of the biencoder and crossencoder architectures from BLINK (Wu et al., 2020). In our experiments on the two task variants, we find both biencoder and crossencoder models significantly outperform the BM25+ baseline. Our results also indicate that the crosslingual task is in general more challenging than the multilingual task. To test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adithya7/xlel-wd
pytorchOfficial

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies