Soft Best-of-n Sampling for Model Alignment
Claudio Mayrink Verdun, Alex Oesterling, Himabindu Lakkaraju, Flavio, P. Calmon

TL;DR
This paper introduces Soft Best-of-n sampling, a method that smoothly interpolates between original and reward-optimized distributions, with theoretical guarantees on convergence and insights into sampling limitations.
Contribution
It proposes Soft Best-of-n sampling, extending traditional BoN sampling with a temperature parameter, and provides theoretical analysis of its convergence and fundamental limitations.
Findings
Soft Best-of-n converges to the optimal tilted distribution at O(1/n) rate.
The method allows smooth control over reward and distribution distortion.
Analysis reveals fundamental limitations of blockwise sampling for discrete sequences.
Abstract
Best-of- (BoN) sampling is a practical approach for aligning language model outputs with human preferences without expensive fine-tuning. BoN sampling is performed by generating responses to a prompt and then selecting the sample that maximizes a reward function. BoN yields high reward values in practice at a distortion cost, as measured by the KL-divergence between the sampled and original distribution. This distortion is coarsely controlled by varying the number of samples: larger yields a higher reward at a higher distortion cost. We introduce Soft Best-of- sampling, a generalization of BoN that allows for smooth interpolation between the original distribution and reward-maximizing distribution through a temperature parameter . We establish theoretical guarantees showing that Soft Best-of- sampling converges sharply to the optimal tilted distribution at a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Reservoir Engineering and Simulation Methods
