A Lean Dataset for International Math Olympiad: Small Steps towards   Writing Math Proofs for Hard Problems

Roozbeh Yousefzadeh; Xuenan Cao; Azim Ospanov

arXiv:2411.18872·cs.LG·March 4, 2025

A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems

Roozbeh Yousefzadeh, Xuenan Cao, Azim Ospanov

PDF

Open Access 2 Datasets

TL;DR

This paper creates a new, small but comprehensive dataset of formal Lean proofs for IMO problems, aiming to facilitate AI development for automated proof writing and evaluation.

Contribution

It provides the first complete formal proofs for all IMO problems in Lean, along with a structured dataset of lemmas, enabling better AI training and assessment.

Findings

01

SOTA LLMs struggle with formal proof generation for IMO problems.

02

Decomposition of proofs into lemmas aids in evaluating AI performance.

03

The dataset offers a new benchmark for formal proof automation.

Abstract

Using AI to write formal proofs for mathematical problems is a challenging task that has seen some advancements in recent years. Automated systems such as Lean can verify the correctness of proofs written in formal language, yet writing the proofs in formal language can be challenging for humans and machines. The miniF2F benchmark has 20 IMO problems in its test set, yet formal proofs are available only for 6 of these problems (3 of which are only written by mathematicians). The model with best accuracy can only prove 2 of these 20 IMO problems, from 1950s and 60s, while its training set is a secret. In this work, we write complete, original formal proofs for the remaining IMO problems in Lean along with 3 extra problems from IMO 2022 and 2023. This effort expands the availability of proof currently in the public domain by creating 5,880 lines of Lean proof. The goal of the paper is to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Education · AI-based Problem Solving and Planning · Manufacturing Process and Optimization

MethodsByte Pair Encoding · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Adam · Attention Is All You Need · Softmax · Label Smoothing · Dropout · Linear Layer