Autoformalization with Large Language Models

Yuhuai Wu; Albert Q. Jiang; Wenda Li; Markus N. Rabe; Charles Staats,; Mateja Jamnik; Christian Szegedy

arXiv:2205.12615·cs.LG·May 26, 2022·41 cites

Autoformalization with Large Language Models

Yuhuai Wu, Albert Q. Jiang, Wenda Li, Markus N. Rabe, Charles Staats,, Mateja Jamnik, Christian Szegedy

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that large language models can effectively autoformalize a significant portion of mathematical problems, improving theorem proving performance and advancing the field of automated formalization.

Contribution

It shows that LLMs can accurately translate natural language mathematics into formal specifications, achieving state-of-the-art results in theorem proving benchmarks.

Findings

01

LLMs can correctly autoformalize 25.3% of problems.

02

Autoformalization improves neural theorem prover performance.

03

State-of-the-art proof rate of 35.2% on MiniF2F.

Abstract

Autoformalization is the process of automatically translating from natural language mathematics to formal specifications and proofs. A successful autoformalization system could advance the fields of formal verification, program synthesis, and artificial intelligence. While the long-term goal of autoformalization seemed elusive for a long time, we show large language models provide new prospects towards this goal. We make the surprising observation that LLMs can correctly translate a significant portion ( $25.3%$ ) of mathematical competition problems perfectly to formal specifications in Isabelle/HOL. We demonstrate the usefulness of this process by improving a previously introduced neural theorem prover via training on these autoformalized theorems. Our methodology results in a new state-of-the-art result on the MiniF2F theorem proving benchmark, improving the proof rate from $29.6%$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Autoformalization with Large Language Models· slideslive

Taxonomy

TopicsMathematics, Computing, and Information Processing · Logic, programming, and type systems · Software Engineering Research