One-Token Verification for Reasoning Correctness Estimation

Zhan Zhuang; Xiequn Wang; Zebin Chen; Feiyang Ye; Ying Wei; Kede Ma; Yu Zhang

arXiv:2603.01025·cs.LG·March 3, 2026

One-Token Verification for Reasoning Correctness Estimation

Zhan Zhuang, Xiequn Wang, Zebin Chen, Feiyang Ye, Ying Wei, Kede Ma, Yu Zhang

PDF

Open Access

TL;DR

This paper introduces One-Token Verification, a method enabling LLMs to estimate reasoning correctness during generation with a single forward pass, improving efficiency and reliability in complex reasoning tasks.

Contribution

The paper presents a novel, learnable token-based verification method integrated into LLMs that accurately assesses reasoning correctness and reduces inference costs.

Findings

01

OTV outperforms existing verifiers on mathematical reasoning benchmarks.

02

OTV enables up to 90% reduction in token usage through early stopping.

03

OTV supports token-level correctness estimation without disrupting primary reasoning.

Abstract

Recent breakthroughs in large language models (LLMs) have led to notable successes in complex reasoning tasks, such as mathematical problem solving. A common strategy for improving performance is parallel thinking, in which multiple reasoning traces are generated and the final prediction is made using aggregation schemes like majority voting or best-of- $N$ decoding. However, two key challenges persist. First, multi-sample decoding incurs substantial inference latency, especially for long-form outputs. Second, effective mechanisms for reliably assessing the correctness of individual reasoning traces are still limited. To address these challenges, we introduce One-Token Verification (OTV), a computational method that estimates reasoning correctness in a single forward pass during generation. OTV is activated by a learnable token and integrated into the LLM via low-rank adaptation to probe…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques