Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

Erxin Yu; Jing Li; Ming Liao; Qi Zhu; Boyang Xue; Minghui Xu; Baojun Wang; Lanqing Hong; Fei Mi; Lifeng Shang

arXiv:2505.22591·cs.CL·May 29, 2025

Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

Erxin Yu, Jing Li, Ming Liao, Qi Zhu, Boyang Xue, Minghui Xu, Baojun Wang, Lanqing Hong, Fei Mi, Lifeng Shang

PDF

Open Access 1 Video

TL;DR

Self-Error-Instruct (SEI) is a novel framework that enhances large language models' mathematical reasoning by generating and utilizing generalized error-focused training data through iterative self-instruction and error clustering.

Contribution

The paper introduces SEI, a method that synthesizes targeted training data from model errors, improving reasoning performance beyond isolated bad cases.

Findings

01

Improved accuracy on GSM8K and MATH datasets.

02

Effective error type clustering enhances data quality.

03

Iterative fine-tuning boosts reasoning generalization.

Abstract

Although large language models demonstrate strong performance across various domains, they still struggle with numerous bad cases in mathematical reasoning. Previous approaches to learning from errors synthesize training data by solely extrapolating from isolated bad cases, thereby failing to generalize the extensive patterns inherent within these cases. This paper presents Self-Error-Instruct (SEI), a framework that addresses these model weaknesses and synthesizes more generalized targeted training data. Specifically, we explore a target model on two mathematical datasets, GSM8K and MATH, to pinpoint bad cases. Then, we generate error keyphrases for these cases based on the instructor model's (GPT-4o) analysis and identify error types by clustering these keyphrases. Next, we sample a few bad cases during each generation for each identified error type and input them into the instructor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning· underline

Taxonomy

TopicsTopic Modeling · Mathematics, Computing, and Information Processing · Natural Language Processing Techniques