130k Lines of Formal Topology in Two Weeks: Simple and Cheap Autoformalization for Everyone?
Josef Urban

TL;DR
This project demonstrates rapid, cost-effective autoformalization of extensive topology content using an LLM and proof checker, making formalization accessible and scalable for everyone.
Contribution
It introduces a simple, low-cost method for large-scale autoformalization combining LLMs with proof checkers, achieving over 130k lines in two weeks.
Findings
130k lines formalized in two weeks
Proofs of major theorems like Urysohn's lemma completed
Autoformalization approach is accessible and inexpensive
Abstract
This is a brief description of a project that has already autoformalized a large portion of the general topology from the Munkres textbook (which has in total 241 pages in 7 chapters and 39 sections). The project has been running since November 21, 2025 and has as of January 4, 2026, produced 160k lines of formalized topology. Most of it (about 130k lines) have been done in two weeks,from December 22 to January 4, for an LLM subscription cost of about $100. This includes a 3k-line proof of Urysohn's lemma, a 2k-line proof of Urysohn's Metrization theorem, over 10k-line proof of the Tietze extension theorem, and many more (in total over 1.5k lemmas/theorems). The approach is quite simple and cheap: build a long-running feedback loop between an LLM and a reasonably fast proof checker equipped with a core foundational library. The LLM is now instantiated as ChatGPT (mostly 5.2) or Claude…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, programming, and type systems · Computability, Logic, AI Algorithms · Philosophy and Theoretical Science
