Learning to Repair Lean Proofs from Compiler Feedback

Evan Wang; Simon Chess; Daniel Lee; Siyuan Ge; Ajit Mallavarapu; Jarod Alper; Vasily Ilin

arXiv:2602.02990·cs.LG·March 17, 2026

Learning to Repair Lean Proofs from Compiler Feedback

Evan Wang, Simon Chess, Daniel Lee, Siyuan Ge, Ajit Mallavarapu, Jarod Alper, Vasily Ilin

PDF

Open Access 2 Models 1 Datasets

TL;DR

This paper introduces APRIL, a large dataset for training neural models to repair Lean proofs using compiler feedback, significantly improving proof repair accuracy and reasoning.

Contribution

We created APRIL, a dataset of 260,000 proof failures with diagnostics and repairs, enabling supervised learning for proof repair from compiler feedback.

Findings

01

Finetuned models outperform open-source baselines in proof repair.

02

Training on APRIL improves feedback-conditioned reasoning.

03

The dataset enhances neural theorem prover capabilities.

Abstract

As neural theorem provers become increasingly agentic, the ability to interpret and act on compiler feedback is critical. However, existing Lean datasets consist almost exclusively of correct proofs, offering little supervision for understanding and repairing failures. We study Lean proof repair as a supervised learning problem: given an erroneous proof and compiler feedback, predict both a corrected proof and a natural-language diagnosis grounded in the same feedback. We introduce APRIL (Automated Proof Repair in Lean), a dataset of 260,000 supervised tuples pairing systematically generated proof failures with compiler diagnostics and aligned repair and explanation targets. Training language models on APRIL substantially improves repair accuracy and feedback-conditioned reasoning; in our single-shot repair evaluation setting, a finetuned 4B-parameter model outperforms the strongest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

uw-math-ai/APRIL
dataset· 110 dl
110 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, programming, and type systems · Software Engineering Research · Software Testing and Debugging Techniques