Quantifying the Plausibility of Context Reliance in Neural Machine   Translation

Gabriele Sarti; Grzegorz Chrupa{\l}a; Malvina Nissim; Arianna Bisazza

arXiv:2310.01188·cs.CL·March 14, 2024·2 cites

Quantifying the Plausibility of Context Reliance in Neural Machine Translation

Gabriele Sarti, Grzegorz Chrupa{\l}a, Malvina Nissim, Arianna Bisazza

PDF

Open Access 4 Repos 6 Models 4 Datasets 1 Video

TL;DR

This paper introduces PECoRe, a framework for evaluating how plausibly neural machine translation models rely on context, using interpretability techniques to compare model rationales with human annotations and identify context-driven predictions.

Contribution

The paper presents PECoRe, a novel interpretability framework that quantifies and analyzes context reliance in language models, addressing limitations of artificial benchmarks.

Findings

01

PECoRe effectively identifies context-sensitive tokens in translations.

02

The framework reveals instances of plausible and implausible context usage.

03

Comparison with human annotations validates the interpretability approach.

Abstract

Establishing whether language models can use contextual information in a human-plausible way is important to ensure their trustworthiness in real-world settings. However, the questions of when and which parts of the context affect model generations are typically tackled separately, with current plausibility evaluations being practically limited to a handful of artificial benchmarks. To address this, we introduce Plausibility Evaluation of Context Reliance (PECoRe), an end-to-end interpretability framework designed to quantify context usage in language models' generations. Our approach leverages model internals to (i) contrastively identify context-sensitive target tokens in generated texts and (ii) link them to contextual cues justifying their prediction. We use \pecore to quantify the plausibility of context-aware machine translation models, comparing model rationales with human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

Quantifying the Plausibility of Context Reliance in Neural Machine Translation· slideslive

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques