OProver: A Unified Framework for Agentic Formal Theorem Proving

David Ma; Kaijing Ma; Shawn Guo; Yunfeng Shi; Enduo Zhao; Jiajun Shi; Zhaoxiang Zhang; Gavin Cheung; Jiaheng Liu; Zili Wang

arXiv:2605.17283·cs.CL·May 19, 2026

OProver: A Unified Framework for Agentic Formal Theorem Proving

David Ma, Kaijing Ma, Shawn Guo, Yunfeng Shi, Enduo Zhao, Jiajun Shi, Zhaoxiang Zhang, Gavin Cheung, Jiaheng Liu, Zili Wang

PDF

1 Repo 6 Models

TL;DR

OProver is a comprehensive framework for agentic formal theorem proving in Lean 4, utilizing iterative proof revision, retrieval, and reinforcement learning to achieve state-of-the-art results across multiple benchmarks.

Contribution

It introduces a unified agentic proving framework with iterative training, proof retrieval, and feedback mechanisms, significantly improving theorem proving performance.

Findings

01

Achieves top Pass@32 scores on MiniF2F, ProverBench, and PutnamBench.

02

Builds a large proof dataset with 1.77M statements and 6.86M proofs.

03

Outperforms prior open-weight provers on multiple benchmarks.

Abstract

Recent progress in formal theorem proving has benefited from large-scale proof generation and verifier-aware training, but agentic proving is rarely integrated into prover training, appearing only at inference time. We present OProver, a unified framework for agentic formal theorem proving in Lean 4, in which failed proof attempts are iteratively revised using retrieved compiler verified proofs and Lean compiler feedback. OProver is trained through continued pretraining followed by iterative post-training: each iteration runs agentic proving, indexes newly verified proofs into OProofs and the retrieval memory, uses repair trajectories as SFT data, and uses unresolved hard cases for RL. OProofs is built from public Lean resources, large-scale proof synthesis, and agentic proving traces, containing 1.77M Lean statements, 6.86M compiler-verified proofs, and serialized trajectories with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

multimodal-art-projection/OProver
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.