The paradox of the compositionality of natural language: a neural   machine translation case study

Verna Dankers; Elia Bruni; Dieuwke Hupkes

arXiv:2108.05885·cs.CL·April 1, 2022·1 cites

The paradox of the compositionality of natural language: a neural machine translation case study

Verna Dankers, Elia Bruni, Dieuwke Hupkes

PDF

Open Access 1 Repo

TL;DR

This paper investigates the compositional abilities of neural machine translation models, revealing complex behaviors and advocating for more realistic, data-driven benchmarks to assess compositionality in natural language processing.

Contribution

It re-instates and reformulates compositionality tests for NMT, providing empirical insights and calling for improved evaluation benchmarks based on real data.

Findings

01

Models trained on more data are more compositional.

02

Models sometimes less or more compositional than expected, showing varying levels of compositionality.

03

Some non-compositional behaviors are errors, others reflect natural data variation.

Abstract

Obtaining human-like performance in NLP is often argued to require compositional generalisation. Whether neural networks exhibit this ability is usually studied by training models on highly compositional synthetic data. However, compositionality in natural language is much more complex than the rigid, arithmetic-like version such data adheres to, and artificial compositionality tests thus do not allow us to determine how neural models deal with more realistic forms of compositionality. In this work, we re-instantiate three compositionality tests from the literature and reformulate them for neural machine translation (NMT). Our results highlight that: i) unfavourably, models trained on more data are more compositional; ii) models are sometimes less compositional than expected, but sometimes more, exemplifying that different levels of compositionality are required, and models are not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

i-machine-think/compositionality_paradox_mt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification