LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

Zhuoxuan Jiang; Haoyuan Peng; Shanshan Feng; Fan Li; Dongsheng Li

arXiv:2405.06705·cs.CL·June 4, 2025·1 cites

LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

Zhuoxuan Jiang, Haoyuan Peng, Shanshan Feng, Fan Li, Dongsheng Li

PDF

Open Access

TL;DR

This paper introduces Pedagogical Chain-of-Thought (PedCoT), a novel prompting strategy inspired by educational theory, to improve LLMs' ability to reliably identify mathematical reasoning mistakes, advancing self-correction methods.

Contribution

The paper presents PedCoT, a new prompting approach based on educational principles, that enhances LLMs' accuracy in detecting mathematical reasoning errors.

Findings

01

PedCoT significantly outperforms baseline prompts in mistake detection.

02

The approach enables reliable mathematical mistake identification in zero-shot settings.

03

Educational theory effectively guides prompt design for complex reasoning tasks.

Abstract

Self-correction is emerging as a promising approach to mitigate the issue of hallucination in Large Language Models (LLMs). To facilitate effective self-correction, recent research has proposed mistake detection as its initial step. However, current literature suggests that LLMs often struggle with reliably identifying reasoning mistakes when using simplistic prompting strategies. To address this challenge, we introduce a unique prompting strategy, termed the Pedagogical Chain-of-Thought (PedCoT), which is specifically designed to guide the identification of reasoning mistakes, particularly mathematical reasoning mistakes. PedCoT consists of pedagogical principles for prompts (PPP) design, two-stage interaction process (TIP) and grounded PedCoT prompts, all inspired by the educational theory of the Bloom Cognitive Model (BCM). We evaluate our approach on two public datasets featuring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing

MethodsBLOOM