G\"odel's Poetry

Kelly J. Davis

arXiv:2512.14252·cs.AI·December 17, 2025

G\"odel's Poetry

Kelly J. Davis

PDF

Open Access

TL;DR

This paper presents a novel multi-agent system employing specialized language models and recursive decomposition for automated theorem proving in Lean4, significantly improving success rates over previous methods.

Contribution

It introduces a multi-agent architecture that combines autoformalization, proof generation, and recursive theorem decomposition, extending the Kimina Lean Server with AST parsing capabilities.

Findings

01

Achieves 90.4% pass rate on miniF2F without decomposition

02

Significant improvement with theorem decomposition

03

Open-source implementation available on GitHub and PyPI

Abstract

Formal, automated theorem proving has long been viewed as a challenge to artificial intelligence. We introduce here a new approach to computer theorem proving, one that employs specialized language models for Lean4 proof generation combined with recursive decomposition of difficult theorems into simpler entailing propositions. These models are coordinated through a multi-agent architecture that orchestrates autoformalization (if required), proof generation, decomposition of difficult theorems into simpler entailing propositions, and recursive proof (and/or decomposition) of these propositions. Without decomposition, we achieve a 90.4% pass rate on miniF2F. With decomposition, this is significantly improved. A key technical contribution lies in our extension of the Kimina Lean Server with abstract syntax tree (AST) parsing capabilities to facilitate automated, recursive proof…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, programming, and type systems · Formal Methods in Verification · Computability, Logic, AI Algorithms