Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving
Xueliang Zhao, Wenda Li, Lingpeng Kong

TL;DR
This paper introduces a subgoal-based demonstration learning framework combined with diffusion models to improve formal theorem proving accuracy and efficiency using large language models.
Contribution
It proposes a novel subgoal construction and refinement method, and applies diffusion models for demonstration organization, achieving significant accuracy and efficiency improvements.
Findings
Proof accuracy increased from 38.9% to 44.3% on miniF2F.
Diffusion models improved demonstration organization, boosting accuracy to 45.5%.
Achieved a 5x increase in sampling efficiency.
Abstract
Large language models~(LLMs) present an intriguing avenue of exploration in the domain of formal theorem proving. Nonetheless, the full utilization of these models, particularly in terms of demonstration formatting and organization, remains an underexplored area. In an endeavor to enhance the efficacy of LLMs, we introduce a subgoal-based demonstration learning framework, consisting of two primary elements: Firstly, drawing upon the insights of subgoal learning from the domains of reinforcement learning and robotics, we propose the construction of distinct subgoals for each demonstration example and refine these subgoals in accordance with the pertinent theories of subgoal learning. Secondly, we build upon recent advances in diffusion models to predict the optimal organization, simultaneously addressing two intricate issues that persist within the domain of demonstration organization:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling
MethodsDiffusion
