MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics

Xinyu Liu; Zixuan Xie; Amir Moeini; Claire Chen; Shuze Daniel Liu; Yu Meng; Aidong Zhang; Shangtong Zhang

arXiv:2602.02561·cs.LO·May 12, 2026

MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics

Xinyu Liu, Zixuan Xie, Amir Moeini, Claire Chen, Shuze Daniel Liu, Yu Meng, Aidong Zhang, Shangtong Zhang

PDF

TL;DR

MathlibLemma introduces an LLM-based pipeline to automatically discover, formalize, and verify folklore lemmas, significantly enriching the Lean mathematical library with verified intermediate facts.

Contribution

The paper presents a novel modular pipeline for mining, formalizing, and verifying folklore lemmas, and creates a comprehensive benchmark for formal mathematics in Lean.

Findings

01

Produced 1,506 verified folklore lemmas in Lean

02

Merged a curated subset into Mathlib for external validation

03

Created a benchmark suite of 4,028 non-trivial Lean statements

Abstract

While the ecosystem of Lean and Mathlib has enjoyed celebrated success in formal mathematical reasoning with the help of large language models (LLMs), the absence of many folklore lemmas in Mathlib remains a persistent barrier that limits Lean's usability as an everyday tool for mathematicians like \LaTeX{} or Maple. To address this, we introduce MathlibLemma, a modular LLM-based pipeline for automated folklore-lemma mining: the discovery, formalization, and proving of reusable intermediate facts that mathematicians often take for granted but that are not always present in formal libraries. At its core, MathlibLemma proactively mines the missing connective tissue of mathematics. The pipeline produces a verified library of folklore-style lemmas, including 1,506 Lean-checked proofs that pass a proof-bypass screen; a small curated pilot subset has also been merged into Mathlib, providing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.