Evaluating and Improving the Coreference Capabilities of Machine   Translation Models

Asaf Yehudai; Arie Cattan; Omri Abend; Gabriel Stanovsky

arXiv:2302.08464·cs.CL·February 17, 2023

Evaluating and Improving the Coreference Capabilities of Machine Translation Models

Asaf Yehudai, Arie Cattan, Omri Abend, Gabriel Stanovsky

PDF

Open Access

TL;DR

This paper assesses how well machine translation models implicitly learn coreference resolution, develops an evaluation method for this, and explores ways to improve MT by integrating coreference information.

Contribution

It introduces a novel evaluation methodology for coreference in MT outputs and demonstrates how incorporating coreference resolution can enhance translation quality.

Findings

01

MT models underperform compared to dedicated coreference resolvers

02

Incorporating coreference information improves translation quality

03

Monolingual coreference resolvers outperform MT models in coreference tasks

Abstract

Machine translation (MT) requires a wide range of linguistic capabilities, which current end-to-end models are expected to learn implicitly by observing aligned sentences in bilingual corpora. In this work, we ask: \emph{How well do MT models learn coreference resolution from implicit signal?} To answer this question, we develop an evaluation methodology that derives coreference clusters from MT output and evaluates them without requiring annotations in the target language. We further evaluate several prominent open-source and commercial MT systems, translating from English to six target languages, and compare them to state-of-the-art coreference resolvers on three challenging benchmarks. Our results show that the monolingual resolvers greatly outperform MT models. Motivated by this result, we experiment with different methods for incorporating the output of coreference resolution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification