Proving Theorems Recursively
Haiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang,, Jianqiao Lu, Zhicheng Yang, Jing Tang, Jian Yin, Zhenguo Li, Xiaodan Liang

TL;DR
POETRY introduces a recursive, level-by-level proof strategy in Isabelle that improves theorem proving success rates and proof lengths by focusing on verifiable sketches and deferring detailed proofs, outperforming existing methods.
Contribution
This paper presents POETRY, a novel recursive proof approach that enhances automated theorem proving by incrementally constructing proofs with verifiable sketches and intermediate conjectures.
Findings
5.1% improvement in success rate on miniF2F dataset
Maximum proof length increased from 10 to 26
Significant performance gains over state-of-the-art methods
Abstract
Recent advances in automated theorem proving leverages language models to explore expanded search spaces by step-by-step proof generation. However, such approaches are usually based on short-sighted heuristics (e.g., log probability or value function scores) that potentially lead to suboptimal or even distracting subgoals, preventing us from finding longer proofs. To address this challenge, we propose POETRY (PrOvE Theorems RecursivelY), which proves theorems in a recursive, level-by-level manner in the Isabelle theorem prover. Unlike previous step-by-step methods, POETRY searches for a verifiable sketch of the proof at each level and focuses on solving the current level's theorem or conjecture. Detailed proofs of intermediate conjectures within the sketch are temporarily replaced by a placeholder tactic called sorry, deferring their proofs to subsequent levels. This approach allows the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHistory and Theory of Mathematics
MethodsPrIme Sample Attention
