Quantifying the Plausibility of Context Reliance in Neural Machine Translation
Gabriele Sarti, Grzegorz Chrupa{\l}a, Malvina Nissim, Arianna Bisazza

TL;DR
This paper introduces PECoRe, a framework for evaluating how plausibly neural machine translation models rely on context, using interpretability techniques to compare model rationales with human annotations and identify context-driven predictions.
Contribution
The paper presents PECoRe, a novel interpretability framework that quantifies and analyzes context reliance in language models, addressing limitations of artificial benchmarks.
Findings
PECoRe effectively identifies context-sensitive tokens in translations.
The framework reveals instances of plausible and implausible context usage.
Comparison with human annotations validates the interpretability approach.
Abstract
Establishing whether language models can use contextual information in a human-plausible way is important to ensure their trustworthiness in real-world settings. However, the questions of when and which parts of the context affect model generations are typically tackled separately, with current plausibility evaluations being practically limited to a handful of artificial benchmarks. To address this, we introduce Plausibility Evaluation of Context Reliance (PECoRe), an end-to-end interpretability framework designed to quantify context usage in language models' generations. Our approach leverages model internals to (i) contrastively identify context-sensitive target tokens in generated texts and (ii) link them to contextual cues justifying their prediction. We use \pecore to quantify the plausibility of context-aware machine translation models, comparing model rationales with human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗context-mt/scat-marian-big-ctx4-cwd1-en-frmodel· 2 dl2 dl
- 🤗context-mt/scat-mbart50-1toM-ctx4-cwd1-en-frmodel· 1 dl1 dl
- 🤗context-mt/scat-marian-small-ctx4-cwd1-en-frmodel· 132 dl132 dl
- 🤗context-mt/scat-mbart50-1toM-target-ctx4-cwd0-en-frmodel· 4 dl4 dl
- 🤗context-mt/scat-marian-big-target-ctx4-cwd0-en-frmodel· 1 dl1 dl
- 🤗context-mt/scat-marian-small-target-ctx4-cwd0-en-frmodel· 122 dl· ♡ 1122 dl♡ 1
Videos
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques
