Embedding Self-Correction as an Inherent Ability in Large Language   Models for Enhanced Mathematical Reasoning

Kuofeng Gao; Huanqia Cai; Qingyao Shuai; Dihong Gong; Zhifeng Li

arXiv:2410.10735·cs.AI·February 11, 2025

Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning

Kuofeng Gao, Huanqia Cai, Qingyao Shuai, Dihong Gong, Zhifeng Li

PDF

Open Access

TL;DR

This paper introduces the Chain of Self-Correction (CoSC), a novel mechanism enabling large language models to iteratively validate and correct their mathematical reasoning, significantly improving accuracy without requiring demonstrations.

Contribution

The paper presents CoSC, a self-correction framework for LLMs that enhances mathematical reasoning by iterative validation and correction, trained with a two-phase fine-tuning approach.

Findings

01

CoSC improves performance on mathematical datasets.

02

CoSC-Code-34B achieves 53.5% on MATH dataset.

03

Operates effectively in zero-shot setting.

Abstract

Accurate mathematical reasoning with Large Language Models (LLMs) is crucial in revolutionizing domains that heavily rely on such reasoning. However, LLMs often encounter difficulties in certain aspects of mathematical reasoning, leading to flawed reasoning and erroneous results. To mitigate these issues, we introduce a novel mechanism, the Chain of Self-Correction (CoSC), specifically designed to embed self-correction as an inherent ability in LLMs, enabling them to validate and rectify their own results. The CoSC mechanism operates through a sequence of self-correction stages. In each stage, the LLMs generate a program to address a given problem, execute this program using program-based tools to obtain an output, subsequently verify this output. Based on the verification, the LLMs either proceed to the next correction stage or finalize the answer. This iterative self-correction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Online Learning and Analytics

MethodsAttention Is All You Need · Dropout · Layer Normalization · Adam · Dense Connections · Residual Connection · Position-Wise Feed-Forward Layer · Linear Layer · Byte Pair Encoding · Absolute Position Encodings