Inference-Time Language Model Alignment via Integrated Value Guidance

Zhixuan Liu; Zhanhui Zhou; Yuanfu Wang; Chao Yang; Yu Qiao

arXiv:2409.17819·cs.CL·September 27, 2024

Inference-Time Language Model Alignment via Integrated Value Guidance

Zhixuan Liu, Zhanhui Zhou, Yuanfu Wang, Chao Yang, Yu Qiao

PDF

Open Access

TL;DR

This paper introduces Integrated Value Guidance (IVG), a novel inference-time method that aligns large language models with human preferences using implicit and explicit value functions, avoiding costly fine-tuning.

Contribution

The paper presents IVG, a new inference-time alignment technique that leverages value functions at token and chunk levels, outperforming traditional fine-tuning methods.

Findings

01

IVG improves sentiment and summarization task alignment.

02

IVG enhances instruction-following benchmark performance.

03

Significant increase in win rates against GPT-4-Turbo with IVG.

Abstract

Large language models are typically fine-tuned to align with human preferences, but tuning large models is computationally intensive and complex. In this work, we introduce $Integrated Value Guidance$ (IVG), a method that uses implicit and explicit value functions to guide language model decoding at token and chunk-level respectively, efficiently aligning large language models purely at inference time. This approach circumvents the complexities of direct fine-tuning and outperforms traditional methods. Empirically, we demonstrate the versatility of IVG across various tasks. In controlled sentiment generation and summarization tasks, our method significantly improves the alignment of large models using inference-time guidance from $gpt2$ -based value functions. Moreover, in a more challenging instruction-following benchmark AlpacaEval 2.0, we show that both specifically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies

MethodsALIGN