AdaBoN: Adaptive Best-of-N Alignment
Vinod Raman, Hilal Asi, Satyen Kale

TL;DR
AdaBoN introduces an adaptive, prompt-specific Best-of-N alignment method that improves efficiency and performance in aligning language models with reward models, especially for larger batches.
Contribution
It proposes a novel two-stage, adaptive algorithm for Best-of-N alignment that dynamically allocates inference resources based on prompt difficulty, enhancing efficiency and effectiveness.
Findings
Outperforms uniform allocation with the same inference budget.
Remains competitive with larger inference budgets.
Improves as batch size increases.
Abstract
Recent advances in test-time alignment methods, such as Best-of-N sampling, offer a simple and effective way to steer language models (LMs) toward preferred behaviors using reward models (RM). However, these approaches can be computationally expensive, especially when applied uniformly across prompts without accounting for differences in alignment difficulty. In this work, we propose a prompt-adaptive strategy for Best-of-N alignment that allocates inference-time compute more efficiently. Motivated by latency concerns, we develop a two-stage algorithm: an initial exploratory phase estimates the reward distribution for each prompt using a small exploration budget, and a second stage adaptively allocates the remaining budget using these estimates. Our method is simple, practical, and compatible with any LM-RM combination. Empirical results on prompts from the AlpacaEval, HH-RLHF, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques
