BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving
Ran Xin, Chenguang Xi, Jie Yang, Feng Chen, Hang Wu, Xia Xiao, Yifan Sun, Shen Zheng, Kai Shen

TL;DR
BFS-Prover demonstrates that a well-scaled, simpler best-first tree search method can achieve state-of-the-art performance in large-scale theorem proving with LLMs, challenging the reliance on more complex search strategies.
Contribution
The paper introduces BFS-Prover, a scalable best-first tree search framework with strategic data filtering, preference optimization, and length normalization, achieving competitive results in theorem proving.
Findings
Achieved 72.95% accuracy on MiniF2F test set.
Challenged the necessity of complex tree search methods.
Open-sourced the BFS-Prover model for community use.
Abstract
Recent advancements in large language models (LLMs) have spurred growing interest in automatic theorem proving using Lean4, where effective tree search methods are crucial for navigating the underlying large proof search spaces. While the existing approaches primarily rely on value functions and/or Monte Carlo Tree Search (MCTS), the potential of simpler methods like Best-First Tree Search (BFS) remains underexplored. In this paper, we investigate whether BFS can achieve competitive performance in large-scale theorem proving tasks. We present BFS-Prover, a scalable expert iteration framework, featuring three key innovations. First, we implement strategic data filtering at each expert iteration round, excluding problems solvable via beam search node expansion to focus on harder cases. Second, we improve the sample efficiency of BFS through Direct Preference Optimization (DPO) applied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, programming, and type systems · Mathematics, Computing, and Information Processing · Advanced Database Systems and Queries
MethodsSparse Evolutionary Training · Focus
