ProofWala: Multilingual Proof Data Synthesis and Theorem-Proving
Amitayush Thakur, George Tsoukalas, Greg Durrett, Swarat Chaudhuri

TL;DR
This paper introduces ProofWala, a multilingual framework for neural theorem-proving that enables cross-ITP transfer and improves proof success rates through multilingual training and standardized interaction protocols.
Contribution
ProofWala provides a novel multilingual proof data synthesis framework and standardized interaction interface for neural theorem-provers across Coq and Lean, facilitating transfer learning.
Findings
Multilingual training improves transfer performance across ITPs.
ProofWala enables systematic evaluation of neural provers on multiple ITPs.
Models trained on multilingual data outperform single-ITP models on proof success metrics.
Abstract
Neural networks have shown substantial promise at automatic theorem-proving in interactive proof assistants (ITPs) like Lean and Coq. However, most neural theorem-proving models are restricted to specific ITPs, leaving out opportunities for cross-lingual between ITPs. We address this weakness with a multilingual proof framework, , that allows a standardized form of interaction between neural theorem-provers and two established ITPs (Coq and Lean). It enables the collection of multilingual proof step data -- data recording the result of proof actions on ITP states -- for training neural provers. allows the systematic evaluation of a model's performance across different ITPs and problem domains via efficient parallel proof search algorithms. We show that multilingual training enabled by ${\rm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Handwritten Text Recognition Techniques
