Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal   Theorem Proving

Xueliang Zhao; Wenda Li; Lingpeng Kong

arXiv:2305.16366·cs.CL·May 29, 2023·2 cites

Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving

Xueliang Zhao, Wenda Li, Lingpeng Kong

PDF

Open Access 1 Repo

TL;DR

This paper introduces a subgoal-based demonstration learning framework combined with diffusion models to improve formal theorem proving accuracy and efficiency using large language models.

Contribution

It proposes a novel subgoal construction and refinement method, and applies diffusion models for demonstration organization, achieving significant accuracy and efficiency improvements.

Findings

01

Proof accuracy increased from 38.9% to 44.3% on miniF2F.

02

Diffusion models improved demonstration organization, boosting accuracy to 45.5%.

03

Achieved a 5x increase in sampling efficiency.

Abstract

Large language models~(LLMs) present an intriguing avenue of exploration in the domain of formal theorem proving. Nonetheless, the full utilization of these models, particularly in terms of demonstration formatting and organization, remains an underexplored area. In an endeavor to enhance the efficacy of LLMs, we introduce a subgoal-based demonstration learning framework, consisting of two primary elements: Firstly, drawing upon the insights of subgoal learning from the domains of reinforcement learning and robotics, we propose the construction of distinct subgoals for each demonstration example and refine these subgoals in accordance with the pertinent theories of subgoal learning. Secondly, we build upon recent advances in diffusion models to predict the optimal organization, simultaneously addressing two intricate issues that persist within the domain of demonstration organization:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkunlp/subgoal-theorem-prover
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling

MethodsDiffusion