TL;DR
Lean Atlas is a tool that visualizes Lean 4 formalizations to assist humans in verifying the semantic correctness of AI-generated proofs, reducing semantic errors in large projects.
Contribution
It introduces a human-in-the-loop approach with an interactive visualization and an algorithm for targeted semantic review in formal proofs.
Findings
Achieved 94-99% node reduction in proof-heavy projects
Reduced candidate set for semantic review by 59.8% in a milestone project
Demonstrated effectiveness across six diverse Lean 4 formalizations
Abstract
AI-driven autoformalization of mathematics is advancing rapidly. However, the type checker of a proof assistant guarantees only the logical correctness of proofs; it does not verify whether propositions and definitions faithfully capture their intended mathematical content. Consequently, AI-generated formal proofs can exhibit semantic hallucination-passing the type checker yet failing to express the intended mathematics. We propose a human-in-the-loop approach in which human scientists and AI collaboratively produce formal proofs, with humans responsible for the semantic verification of propositions and definitions. To realize this approach, we develop Lean Atlas, a Lean 4 tool that visualizes the dependency graph of a Lean 4 project as an interactive web viewer, enabling human scientists to grasp the overall structure of a formalization efficiently. Its core feature, Lean Compass, is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
