Enabling arbitrary translation objectives with Adaptive Tree Search
Wang Ling, Wojciech Stokowiec, Domenic Donato, Laurent Sartran, Lei, Yu, Austin Matthews, Chris Dyer

TL;DR
This paper presents an adaptive tree search algorithm for translation models that can optimize arbitrary objectives, outperforming traditional methods like beam search and reranking, and offers new insights into decoding biases and model calibration.
Contribution
The paper introduces a deterministic adaptive tree search algorithm that enables optimization of arbitrary translation objectives without structural assumptions, surpassing beam search and reranking methods.
Findings
Adaptive tree search finds higher-scoring outputs than beam search.
The algorithm improves results for models with non-additive scoring functions.
Some models benefit from increased search, revealing limitations of beam search bias.
Abstract
We introduce an adaptive tree search algorithm, that can find high-scoring outputs under translation models that make no assumptions about the form or structure of the search objective. This algorithm -- a deterministic variant of Monte Carlo tree search -- enables the exploration of new kinds of models that are unencumbered by constraints imposed to make decoding tractable, such as autoregressivity or conditional independence assumptions. When applied to autoregressive models, our algorithm has different biases than beam search has, which enables a new analysis of the role of decoding bias in autoregressive models. Empirically, we show that our adaptive tree search algorithm finds outputs with substantially better model scores compared to beam search in autoregressive models, and compared to reranking techniques in models whose scores do not decompose additively with respect to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Machine Learning and Algorithms
