LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction
Robert Joseph George, Suozhi Huang, Peiyang Song, Anima Anandkumar

TL;DR
LeanProgress introduces a proof progress prediction method for neural theorem proving in Lean, significantly improving proof search efficiency and accuracy, especially for longer proofs, by estimating remaining proof steps.
Contribution
This work presents a novel proof progress prediction model trained on large Lean proof corpora, enhancing automated theorem proving with better proof search guidance.
Findings
Achieves 75.8% accuracy in progress prediction
Improves proof search efficiency by 3.8% on Mathlib4
Enhances handling of longer proofs in formal verification
Abstract
Mathematical reasoning remains a significant challenge for Large Language Models (LLMs) due to hallucinations. When combined with formal proof assistants like Lean, these hallucinations can be eliminated through rigorous verification, making theorem proving reliable. However, even with formal verification, LLMs still struggle with long proofs and complex mathematical formalizations. While Lean with LLMs offers valuable assistance with retrieving lemmas, generating tactics, or even complete proofs, it lacks a crucial capability: providing a sense of proof progress. This limitation particularly impacts the overall development efficiency in large formalization projects. We introduce LeanProgress, a method that predicts the progress in the proof. Training and evaluating our models made on a large corpus of Lean proofs from Lean Workbook Plus and Mathlib4 and how many steps remain to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing · Machine Learning in Materials Science · Logic, programming, and type systems
