SALTED: A Framework for SAlient Long-Tail Translation Error Detection

Vikas Raunak; Matt Post; Arul Menezes

arXiv:2205.09988·cs.CL·May 23, 2022·6 cites

SALTED: A Framework for SAlient Long-Tail Translation Error Detection

Vikas Raunak, Matt Post, Arul Menezes

PDF

Open Access

TL;DR

SALTED is a framework that detects rare but critical long-tail errors in machine translation, improving transparency and enabling targeted model improvements.

Contribution

It introduces a specifications-based testing framework with high-precision error detectors for identifying long-tail translation errors.

Findings

01

Detectors effectively identify salient long-tail errors.

02

Framework enables filtering, fixing, and testing of MT models.

03

Improves visibility into rare translation failures.

Abstract

Traditional machine translation (MT) metrics provide an average measure of translation quality that is insensitive to the long tail of behavioral problems in MT. Examples include translation of numbers, physical units, dropped content and hallucinations. These errors, which occur rarely and unpredictably in Neural Machine Translation (NMT), greatly undermine the reliability of state-of-the-art MT systems. Consequently, it is important to have visibility into these problems during model development. Towards this direction, we introduce SALTED, a specifications-based framework for behavioral testing of MT models that provides fine-grained views of salient long-tail errors, permitting trustworthy visibility into previously invisible problems. At the core of our approach is the development of high-precision detectors that flag errors (or alternatively, verify output correctness) between a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Adversarial Robustness in Machine Learning