A Bayesian Optimization Approach to Machine Translation Reranking

Julius Cheng; Maike Z\"ufle; Vil\'em Zouhar; Andreas Vlachos

arXiv:2411.09694·cs.CL·April 30, 2025

A Bayesian Optimization Approach to Machine Translation Reranking

Julius Cheng, Maike Z\"ufle, Vil\'em Zouhar, Andreas Vlachos

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a Bayesian optimization method for machine translation reranking that efficiently identifies top candidates with fewer scoring evaluations, reducing computational costs while maintaining output quality.

Contribution

It formulates reranking as a Bayesian optimization problem and incorporates multi-fidelity scoring with proxy models to improve efficiency.

Findings

01

Achieves the same quality with fewer scoring evaluations.

02

Uses multi-fidelity scoring with proxy models for better efficiency.

03

Reduces computational cost by up to 60% without quality loss.

Abstract

Reranking a list of candidates from a machine translation system with an external scoring model and returning the highest-scoring candidate remains a simple and effective method for improving the overall output quality. Translation scoring models continue to grow in size, with the best models being comparable to generation models. Thus, reranking can add substantial computational cost to the translation pipeline. In this work, we pose reranking as a Bayesian optimization (BayesOpt) problem. By strategically selecting candidates to score based on a balance of exploration and exploitation, we show that it is possible to find top-scoring candidates when scoring only a fraction of the candidate list. For instance, our method achieves the same CometKiwi score using only 70 scoring evaluations compared a baseline system using 180. We present a multi-fidelity setting for BayesOpt, where the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

juliusc/bayesopt_reranking
noneOfficial

Videos

A Bayesian Optimization Approach to Machine Translation Reranking· underline

Taxonomy

TopicsNatural Language Processing Techniques