AdaBoN: Adaptive Best-of-N Alignment

Vinod Raman; Hilal Asi; Satyen Kale

arXiv:2505.12050·cs.CL·March 16, 2026

AdaBoN: Adaptive Best-of-N Alignment

Vinod Raman, Hilal Asi, Satyen Kale

PDF

Open Access

TL;DR

AdaBoN introduces an adaptive, prompt-specific Best-of-N alignment method that improves efficiency and performance in aligning language models with reward models, especially for larger batches.

Contribution

It proposes a novel two-stage, adaptive algorithm for Best-of-N alignment that dynamically allocates inference resources based on prompt difficulty, enhancing efficiency and effectiveness.

Findings

01

Outperforms uniform allocation with the same inference budget.

02

Remains competitive with larger inference budgets.

03

Improves as batch size increases.

Abstract

Recent advances in test-time alignment methods, such as Best-of-N sampling, offer a simple and effective way to steer language models (LMs) toward preferred behaviors using reward models (RM). However, these approaches can be computationally expensive, especially when applied uniformly across prompts without accounting for differences in alignment difficulty. In this work, we propose a prompt-adaptive strategy for Best-of-N alignment that allocates inference-time compute more efficiently. Motivated by latency concerns, we develop a two-stage algorithm: an initial exploratory phase estimates the reward distribution for each prompt using a small exploration budget, and a second stage adaptively allocates the remaining budget using these estimates. Our method is simple, practical, and compatible with any LM-RM combination. Empirical results on prompts from the AlpacaEval, HH-RLHF, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques