A Minimal Agent for Automated Theorem Proving

Borja Requena; Austin Letson; Krystian Nowakowski; Izan Beltran-Ferreiro; Leopoldo Sarra

arXiv:2602.24273·cs.AI·May 15, 2026

A Minimal Agent for Automated Theorem Proving

Borja Requena, Austin Letson, Krystian Nowakowski, Izan Beltran-Ferreiro, Leopoldo Sarra

PDF

TL;DR

This paper introduces a simple, open-source agentic framework for automated theorem proving that achieves competitive results with less complexity and cost than existing systems.

Contribution

It presents a minimal, modular agentic architecture for theorem proving, enabling systematic comparison and demonstrating advantages of iterative proof refinement.

Findings

01

Competitive performance with simpler architecture

02

Iterative approach outperforms single-shot methods

03

Cost-effective and sample-efficient proof generation

Abstract

We propose a minimal agentic baseline that enables systematic comparison across different AI-based theorem prover architectures. This design implements the core features shared among state-of-the-art systems: iterative proof refinement, library search and context management. We evaluate this agentic approach using qualitatively different benchmarks and compare various frontier language models and design choices. Our results show competitive performance compared to state-of-the-art approaches, while using a significantly simpler architecture and a fraction of their cost. Additionally, we demonstrate consistent advantages of an iterative approach over multiple single-shot generations, especially in terms of sample efficiency and cost effectiveness. The implementation is released open-source as a candidate reference for future research and as an accessible prover for the community.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.