Controlled Decoding from Language Models

Sidharth Mudgal; Jong Lee; Harish Ganapathy; YaGuang Li and; Tao Wang; Yanping Huang; Zhifeng Chen; Heng-Tze Cheng; Michael; Collins; Trevor Strohman; Jilin Chen; Alex Beutel; Ahmad Beirami

arXiv:2310.17022·cs.LG·June 5, 2024·2 cites

Controlled Decoding from Language Models

Sidharth Mudgal, Jong Lee, Harish Ganapathy, YaGuang Li and, Tao Wang, Yanping Huang, Zhifeng Chen, Heng-Tze Cheng, Michael, Collins, Trevor Strohman, Jilin Chen, Alex Beutel, Ahmad Beirami

PDF

Open Access

TL;DR

Controlled decoding (CD) is a modular, tokenwise RL-based method that uses a prefix scorer to steer language model outputs towards desired rewards, enabling effective control, multi-objective handling, and transferability without retraining.

Contribution

Introduces a novel modular controlled decoding approach that employs a prefix scorer for tokenwise RL, allowing flexible, multi-objective, and transferable control of language models without additional training.

Findings

01

Effective control on benchmark tasks

02

Combines multiple rewards at inference time

03

Transfers control to unseen models without tuning

Abstract

KL-regularized reinforcement learning (RL) is a popular alignment framework to control the language model responses towards high reward outcomes. We pose a tokenwise RL objective and propose a modular solver for it, called controlled decoding (CD). CD exerts control through a separate prefix scorer module, which is trained to learn a value function for the reward. The prefix scorer is used at inference time to control the generation from a frozen base model, provably sampling from a solution to the RL objective. We empirically demonstrate that CD is effective as a control mechanism on popular benchmarks. We also show that prefix scorers for multiple rewards may be combined at inference time, effectively solving a multi-objective RL problem with no additional training. We show that the benefits of applying CD transfer to an unseen base model with no further tuning as well. Finally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsBalanced Selection