Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models
Che Zhang, Zhenyang Xiao, Chengcheng Han, Yixin Lian and, Yuejian Fang

TL;DR
This paper introduces a new training format called 'Step CoT Check' to improve large language models' ability to self-detect and correct reasoning errors, significantly enhancing their self-checking performance across benchmarks.
Contribution
It proposes a novel 'Step CoT Check' format and dataset for training LLMs, leading to improved self-correction abilities, especially in reasoning tasks.
Findings
Enhanced error detection and correction in LLMs.
Significant performance gains on multiple benchmarks.
Greater benefits observed in larger models.
Abstract
Self-correction has achieved impressive results in enhancing the style and security of the generated output from large language models (LLMs). However, recent studies suggest that self-correction might be limited or even counterproductive in reasoning tasks due to LLMs' difficulties in identifying logical mistakes. In this paper, we aim to enhance the self-checking capabilities of LLMs by constructing training data for checking tasks. Specifically, we apply the Chain of Thought (CoT) methodology to self-checking tasks, utilizing fine-grained step-level analyses and explanations to assess the correctness of reasoning paths. We propose a specialized checking format called "Step CoT Check". Following this format, we construct a checking-correction dataset that includes detailed step-by-step analysis and checking. Then we fine-tune LLMs to enhance their error detection and correction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
