LemmaHead: RAG Assisted Proof Generation Using Large Language Models

Tianbo Yang; Mingqi Yan; Hongyi Zhao; Tianshuo Yang

arXiv:2501.15797·cs.LG·February 11, 2025

LemmaHead: RAG Assisted Proof Generation Using Large Language Models

Tianbo Yang, Mingqi Yan, Hongyi Zhao, Tianshuo Yang

PDF

Open Access

TL;DR

LemmatHead leverages retrieval augmented generation to enhance large language models' ability to generate mathematical proofs, focusing on theorem proving in the Lean language by supplementing models with relevant textbook context.

Contribution

This work introduces LemmaHead, a RAG-based knowledge base that improves mathematical reasoning in LLMs by integrating relevant textbook information for proof generation.

Findings

01

Improved proof generation accuracy in Lean theorem proving

02

Effective integration of textbook context enhances reasoning

03

Demonstrated benefits of RAG in mathematical tasks

Abstract

Developing the logic necessary to solve mathematical problems or write mathematical proofs is one of the more difficult objectives for large language models (LLMS). Currently, the most popular methods in literature consists of fine-tuning the model on written mathematical content such as academic publications and textbooks, so that the model can learn to emulate the style of mathematical writing. In this project, we explore the effectiveness of using retrieval augmented generation (RAG) to address gaps in the mathematical reasoning of LLMs. We develop LemmaHead, a RAG knowledge base that supplements queries to the model with relevant mathematical context, with particular focus on context from published textbooks. To measure our model's performance in mathematical reasoning, our testing paradigm focuses on the task of automated theorem proving via generating proofs to a given…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing · Handwritten Text Recognition Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Dense Connections · Softmax · Linear Warmup With Linear Decay · Adam · Residual Connection · Dropout · Byte Pair Encoding · WordPiece