CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute

Chen Jin; Ryutaro Tanno; Tom Diethe; Philip Teare

arXiv:2602.08948·cs.AI·February 10, 2026

CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute

Chen Jin, Ryutaro Tanno, Tom Diethe, Philip Teare

PDF

Open Access

TL;DR

CoRefine is a confidence-guided self-refinement method for large language models that reduces compute by selectively halting or re-examining based on confidence, achieving high accuracy with fewer tokens.

Contribution

It introduces a lightweight controller that guides self-refinement in LLMs, enabling efficient reasoning with significantly less compute and high confidence accuracy.

Findings

01

Achieves 92.6% confidence halting precision

02

Reduces token usage by roughly 190-fold compared to baseline

03

Enables scalable reasoning with a modular primitive

Abstract

Large Language Models (LLMs) often rely on test-time scaling via parallel decoding (for example, 512 samples) to boost reasoning accuracy, but this incurs substantial compute. We introduce CoRefine, a confidence-guided self-refinement method that achieves competitive accuracy using a fraction of the tokens via a lightweight 211k-parameter Conv1D controller atop a frozen LLM. The controller consumes full-trace confidence to decide whether to halt, re-examine, or try a different approach, enabling targeted self-correction with an average of 2.7 refinement steps per problem and roughly 190-fold token reduction relative to 512-sample baselines. Across diverse reasoning benchmarks and three open-source models, the controller achieves 92.6 percent precision when it confidently halts, indicating that confidence dynamics reliably signal correctness without ground-truth verification. We extend…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms