Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion   Algorithms

Firas Trabelsi; David Vilar; Mara Finkelstein; Markus Freitag

arXiv:2406.02832·cs.CL·June 6, 2024

Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Firas Trabelsi, David Vilar, Mara Finkelstein, Markus Freitag

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a low-rank matrix completion approach to approximate Minimum Bayes Risk decoding in machine translation, significantly reducing computation while maintaining translation quality.

Contribution

It formulates MBR decoding as a low-rank matrix completion problem and applies ALS to efficiently approximate scores, reducing computations by 16 times.

Findings

01

Achieves similar translation quality with 1/16th of the utility computations.

02

Empirically confirms the low-rank structure of score matrices.

03

Outperforms other approximation methods in quality benchmarks.

Abstract

Minimum Bayes Risk (MBR) decoding is a powerful decoding strategy widely used for text generation tasks, but its quadratic computational complexity limits its practical application. This paper presents a novel approach for approximating MBR decoding using matrix completion techniques, focusing on the task of machine translation. We formulate MBR decoding as a matrix completion problem, where the utility metric scores between candidate hypotheses and pseudo-reference translations form a low-rank matrix. First, we empirically show that the scores matrices indeed have a low-rank structure. Then, we exploit this by only computing a random subset of the scores and efficiently recover the missing entries in the matrix by applying the Alternating Least Squares (ALS) algorithm, thereby enabling a fast approximation of the MBR decoding process. Our experimental results on machine translation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

naist-nlp/mbrs
pytorch

Videos

Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms· slideslive

Taxonomy

TopicsFace and Expression Recognition · Blind Source Separation Techniques · Neural Networks and Applications