Learning to Reason with Insight for Informal Theorem Proving

Yunhe Li; Hao Shi; Bowen Deng; Wei Wang; Mengzhe Ruan; Hanxu Hou; Zhongxiang Dai; Siyang Gao; Chao Wang; Shuang Qiu; Linqi Song

arXiv:2604.16278·cs.AI·April 20, 2026

Learning to Reason with Insight for Informal Theorem Proving

Yunhe Li, Hao Shi, Bowen Deng, Wei Wang, Mengzhe Ruan, Hanxu Hou, Zhongxiang Dai, Siyang Gao, Chao Wang, Shuang Qiu, Linqi Song

PDF

TL;DR

This paper introduces a new framework and dataset to enhance large language models' ability to perform insightful reasoning in informal theorem proving, significantly improving their mathematical problem-solving skills.

Contribution

It presents DeepInsightTheorem, a hierarchical dataset of informal proofs, and a multi-stage training strategy to teach models core techniques and insightful reasoning.

Findings

01

Insight-aware training improves reasoning accuracy

02

Hierarchical dataset helps models identify core techniques

03

Progressive learning mimics human reasoning process

Abstract

Although most of the automated theorem-proving approaches depend on formal proof systems, informal theorem proving can align better with large language models' (LLMs) strength in natural language processing. In this work, we identify a primary bottleneck in informal theorem proving as a lack of insight, namely the difficulty of recognizing the core techniques required to solve complex problems. To address this, we propose a novel framework designed to cultivate this essential reasoning skill and enable LLMs to perform insightful reasoning. We propose $DeepInsightTheorem$ , a hierarchical dataset that structures informal proofs by explicitly extracting core techniques and proof sketches alongside the final proof. To fully exploit this dataset, we design a Progressive Multi-Stage SFT strategy that mimics the human learning process, guiding the model from basic proof writing to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.