ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification

Hyunseok Lee; Seunghyuk Oh; Jaehyung Kim; Jinwoo Shin; Jihoon Tack

arXiv:2502.14565·cs.LG·July 16, 2025

ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification

Hyunseok Lee, Seunghyuk Oh, Jaehyung Kim, Jinwoo Shin, Jihoon Tack

PDF

Open Access 1 Video

TL;DR

ReVISE introduces a novel framework enabling large language models to self-verify and correct their reasoning processes during inference, significantly enhancing their reasoning accuracy without relying on external verifiers.

Contribution

The paper presents ReVISE, a new self-verification framework for LLMs that uses curriculum learning and confidence-aware decoding to improve reasoning performance at test time.

Findings

01

ReVISE improves reasoning accuracy across multiple tasks.

02

Self-verification reduces errors in reasoning outputs.

03

Efficient training with preference learning enhances correction capabilities.

Abstract

Self-awareness, i.e., the ability to assess and correct one's own generation, is a fundamental aspect of human intelligence, making its replication in large language models (LLMs) an important yet challenging task. Previous works tackle this by employing extensive reinforcement learning or rather relying on large external verifiers. In this work, we propose Refine via Intrinsic Self-Verification (ReVISE), an efficient and effective framework that enables LLMs to self-correct their outputs through self-verification. The core idea of ReVISE is to enable LLMs to verify their reasoning processes and continually rethink reasoning trajectories based on its verification. We introduce a structured curriculum based upon online preference learning to implement this efficiently. Specifically, as ReVISE involves two challenging tasks (i.e., self-verification and reasoning correction), we tackle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification· slideslive

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning