A Survey on Deep Learning for Theorem Proving
Zhaoyu Li, Jialiang Sun, Logan Murphy, Qidong Su, Zenan Li, Xian, Zhang, Kaiyu Yang, Xujie Si

TL;DR
This survey reviews recent advances in applying deep learning, especially large language models, to various aspects of theorem proving, highlighting datasets, evaluation metrics, challenges, and future directions.
Contribution
It provides a comprehensive overview of deep learning techniques in theorem proving, including approaches, datasets, evaluation, and critical analysis of challenges and future prospects.
Findings
Deep learning has significantly impacted theorem proving research.
Large language models are increasingly used for proof generation and formalization.
The survey identifies key challenges and promising future research directions.
Abstract
Theorem proving is a fundamental aspect of mathematics, spanning from informal reasoning in natural language to rigorous derivations in formal systems. In recent years, the advancement of deep learning, especially the emergence of large language models, has sparked a notable surge of research exploring these techniques to enhance the process of theorem proving. This paper presents a comprehensive survey of deep learning for theorem proving by offering (i) a thorough review of existing approaches across various tasks such as autoformalization, premise selection, proofstep generation, and proof search; (ii) an extensive summary of curated datasets and strategies for synthetic data generation; (iii) a detailed analysis of evaluation metrics and the performance of state-of-the-art methods; and (iv) a critical discussion on the persistent challenges and the promising avenues for future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, programming, and type systems · Mathematics, Computing, and Information Processing · Semantic Web and Ontologies
