Advancing Mathematical Research via Human-AI Interactive Theorem Proving
Chenyi Li, Zhijian Lai, Dong An, Jiang Hu, Zaiwen Wen

TL;DR
This paper presents a human-AI collaborative workflow for mathematical theorem proving that leverages large language models to enhance research efficiency while ensuring mathematical rigor.
Contribution
It introduces a novel human-in-the-loop approach integrating LLMs into theorem proving and scientific computing workflows, demonstrated through a case study on quantum algorithms.
Findings
LLMs assist in proof search and theorem discovery.
The workflow accelerates exploration of mathematical structures.
Framework maintains transparency and rigor in AI-assisted research.
Abstract
We investigate how large language models can be used as research tools in scientific computing while preserving mathematical rigor. We propose a human-in-the-loop workflow for interactive theorem proving and discovery with LLMs. Human experts retain control over problem formulation and admissible assumptions, while the model searches for proofs or contradictions, proposes candidate properties and theorems, and helps construct structures and parameters that satisfy explicit constraints, supported by numerical experiments and simple verification checks. Experts treat these outputs as raw material, further refine them, and organize the results into precise statements and rigorous proofs. We instantiate this workflow in a case study on the connection between manifold optimization and Grover's quantum search algorithm, where the pipeline helps identify invariant subspaces, explore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Logic, programming, and type systems · Constraint Satisfaction and Optimization
