Inference-Time Language Model Alignment via Integrated Value Guidance
Zhixuan Liu, Zhanhui Zhou, Yuanfu Wang, Chao Yang, Yu Qiao

TL;DR
This paper introduces Integrated Value Guidance (IVG), a novel inference-time method that aligns large language models with human preferences using implicit and explicit value functions, avoiding costly fine-tuning.
Contribution
The paper presents IVG, a new inference-time alignment technique that leverages value functions at token and chunk levels, outperforming traditional fine-tuning methods.
Findings
IVG improves sentiment and summarization task alignment.
IVG enhances instruction-following benchmark performance.
Significant increase in win rates against GPT-4-Turbo with IVG.
Abstract
Large language models are typically fine-tuned to align with human preferences, but tuning large models is computationally intensive and complex. In this work, we introduce (IVG), a method that uses implicit and explicit value functions to guide language model decoding at token and chunk-level respectively, efficiently aligning large language models purely at inference time. This approach circumvents the complexities of direct fine-tuning and outperforms traditional methods. Empirically, we demonstrate the versatility of IVG across various tasks. In controlled sentiment generation and summarization tasks, our method significantly improves the alignment of large models using inference-time guidance from -based value functions. Moreover, in a more challenging instruction-following benchmark AlpacaEval 2.0, we show that both specifically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsALIGN
