SITA: A Framework for Structure-to-Instance Theorem Autoformalization
Chenyi Li, Wanli Ma, Zichen Wang, Zaiwen Wen

TL;DR
This paper introduces SITA, a framework that automates the formalization of concrete mathematical theorems from abstract structures using large language models and Lean proof assistant, improving efficiency and correctness.
Contribution
SITA provides a systematic, modular approach for auto-formalizing theorems by bridging abstract theories and concrete instances with LLM-guided generation and verification.
Findings
Effective formalization of diverse instances in optimization problems
Integration of LLMs with feedback for correctness
Successful application in Lean proof assistant
Abstract
While large language models (LLMs) have shown progress in mathematical reasoning, they still face challenges in formalizing theorems that arise from instantiating abstract structures in concrete settings. With the goal of auto-formalizing mathematical results at the research level, we develop a framework for structure-to-instance theorem autoformalization (SITA), which systematically bridges the gap between abstract mathematical theories and their concrete applications in Lean proof assistant. Formalized abstract structures are treated as modular templates that contain definitions, assumptions, operations, and theorems. These templates serve as reusable guides for the formalization of concrete instances. Given a specific instantiation, we generate corresponding Lean definitions and instance declarations, integrate them using Lean's typeclass mechanism, and construct verified theorems by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, programming, and type systems · Model-Driven Software Engineering Techniques · Constraint Satisfaction and Optimization
