Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Jiangjie Chen; Wenxiang Chen; Jiacheng Du; Jinyi Hu; Zhicheng Jiang; Allan Jie; Xiaoran Jin; Xing Jin; Chenggang Li; Wenlei Shi; Zhihong Wang; Mingxuan Wang; Chenrui Wei; Shufa Wei; Huajian Xin; Fan Yang; Weihao Gao; Zheng Yuan; Tianyang Zhan; Zeyu Zheng; Tianxi Zhou; Thomas Hanwen Zhu

arXiv:2512.17260·cs.CL·December 22, 2025

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Jiangjie Chen, Wenxiang Chen, Jiacheng Du, Jinyi Hu, Zhicheng Jiang, Allan Jie, Xiaoran Jin, Xing Jin, Chenggang Li, Wenlei Shi, Zhihong Wang, Mingxuan Wang, Chenrui Wei, Shufa Wei, Huajian Xin, Fan Yang, Weihao Gao, Zheng Yuan, Tianyang Zhan, Zeyu Zheng, Tianxi Zhou

PDF

Open Access

TL;DR

Seed-Prover 1.5 leverages reinforcement learning and experience accumulation to enhance formal theorem proving, achieving high success rates on undergraduate to PhD-level problems efficiently.

Contribution

The paper introduces Seed-Prover 1.5, a novel formal theorem-proving model trained with large-scale agentic reinforcement learning and an efficient test-time scaling workflow.

Findings

01

Solves 88% of PutnamBench problems

02

Achieves 80% success on Fate-H

03

Solves 11 of 12 Putnam 2025 problems within 9 hours

Abstract

Large language models have recently made significant progress to generate rigorous mathematical proofs. In contrast, utilizing LLMs for theorem proving in formal languages (such as Lean) remains challenging and computationally expensive, particularly when addressing problems at the undergraduate level and beyond. In this work, we present \textbf{Seed-Prover 1.5}, a formal theorem-proving model trained via large-scale agentic reinforcement learning, alongside an efficient test-time scaling (TTS) workflow. Through extensive interactions with Lean and other tools, the model continuously accumulates experience during the RL process, substantially enhancing the capability and efficiency of formal theorem proving. Furthermore, leveraging recent advancements in natural language proving, our TTS workflow efficiently bridges the gap between natural and formal languages. Compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, programming, and type systems · Machine Learning in Materials Science · Mathematics, Computing, and Information Processing