Enhancing Formal Theorem Proving: A Comprehensive Dataset for Training   AI Models on Coq Code

Andreas Florath

arXiv:2403.12627·cs.AI·April 3, 2024·1 cites

Enhancing Formal Theorem Proving: A Comprehensive Dataset for Training AI Models on Coq Code

Andreas Florath

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper introduces a large, comprehensive Coq dataset to improve AI models' ability to generate correct and meaningful Coq code, advancing automated theorem proving.

Contribution

The creation of a detailed Coq dataset from over 10,000 files to enhance LLM training for formal theorem proving tasks.

Findings

01

Models trained on the dataset showed improved Coq code generation accuracy.

02

A fine-tuned LLM generated 141 valid proofs for a basic lemma.

03

The dataset facilitates the discovery of diverse proof strategies.

Abstract

In the realm of formal theorem proving, the Coq proof assistant stands out for its rigorous approach to verifying mathematical assertions and software correctness. Despite the advances in artificial intelligence and machine learning, the specialized nature of Coq syntax and semantics poses unique challenges for Large Language Models (LLMs). Addressing this gap, we present a comprehensive dataset specifically designed to enhance LLMs' proficiency in interpreting and generating Coq code. This dataset, derived from a collection of over 10,000 Coq source files, encompasses a wide array of propositions, proofs, and definitions, enriched with metadata including source references and licensing information. Our primary aim is to facilitate the development of LLMs capable of generating syntactically correct and semantically meaningful Coq constructs, thereby advancing the frontier of automated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
florath/CoqLLM-FineTuned-Experiment-Gen0
model· 9 dl· ♡ 2
9 dl♡ 2

Datasets

florath/coq-facts-props-proofs-gen0-v1
dataset· 44 dl
44 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Computational Techniques and Applications · Scientific Computing and Data Management · AI-based Problem Solving and Planning