One-Token Verification for Reasoning Correctness Estimation
Zhan Zhuang, Xiequn Wang, Zebin Chen, Feiyang Ye, Ying Wei, Kede Ma, Yu Zhang

TL;DR
This paper introduces One-Token Verification, a method enabling LLMs to estimate reasoning correctness during generation with a single forward pass, improving efficiency and reliability in complex reasoning tasks.
Contribution
The paper presents a novel, learnable token-based verification method integrated into LLMs that accurately assesses reasoning correctness and reduces inference costs.
Findings
OTV outperforms existing verifiers on mathematical reasoning benchmarks.
OTV enables up to 90% reduction in token usage through early stopping.
OTV supports token-level correctness estimation without disrupting primary reasoning.
Abstract
Recent breakthroughs in large language models (LLMs) have led to notable successes in complex reasoning tasks, such as mathematical problem solving. A common strategy for improving performance is parallel thinking, in which multiple reasoning traces are generated and the final prediction is made using aggregation schemes like majority voting or best-of- decoding. However, two key challenges persist. First, multi-sample decoding incurs substantial inference latency, especially for long-form outputs. Second, effective mechanisms for reliably assessing the correctness of individual reasoning traces are still limited. To address these challenges, we introduce One-Token Verification (OTV), a computational method that estimates reasoning correctness in a single forward pass during generation. OTV is activated by a learnable token and integrated into the LLM via low-rank adaptation to probe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques
