Boosting LLM Reasoning via Spontaneous Self-Correction

Xutong Zhao; Tengyu Xu; Xuewei Wang; Zhengxing Chen; Di Jin; Liang Tan; Yen-Ting; Zishun Yu; Zhuokai Zhao; Yun He; Sinong Wang; Han Fang; Sarath Chandar; Chen Zhu

arXiv:2506.06923·cs.AI·June 10, 2025

Boosting LLM Reasoning via Spontaneous Self-Correction

Xutong Zhao, Tengyu Xu, Xuewei Wang, Zhengxing Chen, Di Jin, Liang Tan, Yen-Ting, Zishun Yu, Zhuokai Zhao, Yun He, Sinong Wang, Han Fang, Sarath Chandar, Chen Zhu

PDF

Open Access

TL;DR

This paper introduces SPOC, a spontaneous self-correction method for large language models that interleaves solution generation and verification in a single pass, significantly improving math reasoning accuracy.

Contribution

SPOC enables real-time, interleaved solution and verification in LLMs through a multi-agent approach and reinforcement learning, advancing beyond previous post-hoc correction methods.

Findings

01

SPOC improves Llama-3.1-8B and 70B models' accuracy on math benchmarks.

02

Achieves up to 20% accuracy gains on AMC23.

03

Demonstrates effective real-time self-correction in LLMs.

Abstract

While large language models (LLMs) have demonstrated remarkable success on a broad range of tasks, math reasoning remains a challenging one. One of the approaches for improving math reasoning is self-correction, which designs self-improving loops to let the model correct its own mistakes. However, existing self-correction approaches treat corrections as standalone post-generation refinements, relying on extra prompt and system designs to elicit self-corrections, instead of performing real-time, spontaneous self-corrections in a single pass. To address this, we propose SPOC, a spontaneous self-correction approach that enables LLMs to generate interleaved solutions and verifications in a single inference pass, with generation dynamically terminated based on verification outcomes, thereby effectively scaling inference time compute. SPOC considers a multi-agent perspective by assigning dual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications