Robust Search with Uncertainty-Aware Value Models for Language Model Reasoning
Fei Yu, Yingru Li, Benyou Wang

TL;DR
This paper introduces an uncertainty-aware framework for language model reasoning that improves robustness by using value distributions and a probabilistic candidate selection method, significantly reducing verifier failures especially on out-of-distribution problems.
Contribution
It presents the first systematic integration of uncertainty quantification into LLM search methods, replacing point estimates with value distributions and employing Group Thompson Sampling.
Findings
Reduces verifier failure on OOD problems
Increases solution coverage in reasoning tasks
Enhances robustness of value model guided search
Abstract
Value model guided search is effective in steering LLM generation but suffers from a lack of robustness. This is due to verifier failure: imperfect VMs mistakenly prune valid reasoning paths, especially when encountering unseen reasoning paths generated during search. To address this, we propose an uncertainty-aware framework with two key components: (1) Uncertainty-Aware Value Models (UVMs), which replace single-point value estimates with value distributions to quantify prediction reliability, and (2) Group Thompson Sampling, an efficient algorithm that selects candidates based on their probability of being optimal. Experiments on two In-Distribution (ID) settings (GSM8K, MATH) and three Out-Of-Distribution (OOD) settings (e.g., AIME25, Minerva Math) show our method significantly mitigates verifier failure and boosts solution coverage, especially on OOD problems. This work provides the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Advanced Database Systems and Queries · Service-Oriented Architecture and Web Services
