OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and   Omission Translation Errors Detection

Chenyang Huang; Abbas Ghaddar; Ivan Kobyzev; Mehdi Rezagholizadeh,; Osmar R. Zaiane; Boxing Chen

arXiv:2406.01919·cs.CL·June 5, 2024

OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection

Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh,, Osmar R. Zaiane, Boxing Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

OTTAWA is a novel OT-based word aligner that improves hallucination and omission detection in machine translation, achieving competitive results without needing internal MT system states.

Contribution

Introduces OTTAWA, a new optimal transport-based aligner with a null vector for adaptive error detection in MT, outperforming existing methods on multiple language pairs.

Findings

01

Competitive results on the HalOmi benchmark.

02

Effective at distinguishing hallucinations from omissions.

03

Performs word-level detection without internal MT states.

Abstract

Recently, there has been considerable attention on detecting hallucinations and omissions in Machine Translation (MT) systems. The two dominant approaches to tackle this task involve analyzing the MT system's internal states or relying on the output of external tools, such as sentence similarity or MT quality estimators. In this work, we introduce OTTAWA, a novel Optimal Transport (OT)-based word aligner specifically designed to enhance the detection of hallucinations and omissions in MT systems. Our approach explicitly models the missing alignments by introducing a "null" vector, for which we propose a novel one-side constrained OT setting to allow an adaptive null alignment. Our approach yields competitive results compared to state-of-the-art methods across 18 language pairs on the HalOmi benchmark. In addition, it shows promising features, such as the ability to distinguish between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chenyangh/OTTAWA
pytorchOfficial

Videos

OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection· underline

Taxonomy

TopicsBrain Tumor Detection and Classification