AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages   with Adversarial Examples

Qianchu Liu; Edoardo M. Ponti; Diana McCarthy; Ivan Vuli\'c; Anna; Korhonen

arXiv:2104.08639·cs.CL·September 21, 2021

AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples

Qianchu Liu, Edoardo M. Ponti, Diana McCarthy, Ivan Vuli\'c, Anna, Korhonen

PDF

Open Access 1 Repo

TL;DR

AM2iCo is a comprehensive multilingual evaluation dataset designed to assess how well state-of-the-art models understand word meaning in context across diverse languages, highlighting significant gaps especially in low-resource and dissimilar languages.

Contribution

The paper introduces AM2iCo, a new evaluation set that overcomes limitations of existing datasets by covering more languages and providing adversarial examples for better model diagnostics.

Findings

01

Current models underperform humans in understanding word meaning in context.

02

Performance gaps are largest for low-resource and dissimilar languages.

03

AM2iCo reveals the need for improved multilingual and cross-lingual models.

Abstract

Capturing word meaning in context and distinguishing between correspondences and variations across languages is key to building successful multilingual and cross-lingual text representation models. However, existing multilingual evaluation datasets that evaluate lexical semantics "in-context" have various limitations. In particular, 1) their language coverage is restricted to high-resource languages and skewed in favor of only a few language families and areas, 2) a design that makes the task solvable via superficial cues, which results in artificially inflated (and sometimes super-human) performances of pretrained encoders, on many target languages, which limits their usefulness for model probing and diagnostics, and 3) little support for cross-lingual evaluation. In order to address these gaps, we present AM2iCo (Adversarial and Multilingual Meaning in Context), a wide-coverage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cambridgeltl/AM2iCo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification