Modeling User Preferences with Automatic Metrics: Creating a   High-Quality Preference Dataset for Machine Translation

Sweta Agrawal; Jos\'e G. C. de Souza; Ricardo Rei; Ant\'onio Farinhas,; Gon\c{c}alo Faria; Patrick Fernandes; Nuno M Guerreiro; Andre Martins

arXiv:2410.07779·cs.CL·October 11, 2024

Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation

Sweta Agrawal, Jos\'e G. C. de Souza, Ricardo Rei, Ant\'onio Farinhas,, Gon\c{c}alo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins

PDF

Open Access 1 Video

TL;DR

This paper introduces MT-Pref, a large dataset of 18,000 translation preferences derived from human assessments and automatic metrics, to improve machine translation quality by aligning models with human-like preferences.

Contribution

It presents a novel method for creating a preference dataset by combining human judgments with automatic metrics, enhancing translation model training.

Findings

01

Aligning models on MT-Pref improves translation quality on benchmarks.

02

The dataset covers 18 languages and multiple domains.

03

Automatic metrics can effectively recover human preferences.

Abstract

Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved quality. However, preference data based on human feedback can be very expensive to obtain and curate at a large scale. Automatic metrics, on the other hand, can induce preferences, but they might not match human expectations perfectly. In this paper, we propose an approach that leverages the best of both worlds. We first collect sentence-level quality assessments from professional linguists on translations generated by multiple high-quality MT systems and evaluate the ability of current automatic metrics to recover these preferences. We then use this analysis to curate a new dataset, MT-Pref (metric induced translation preference) dataset,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies