QEDCartographer: Automating Formal Verification Using Reward-Free   Reinforcement Learning

Alex Sanchez-Stern; Abhishek Varghese; Zhanna Kaufman; Dylan Zhang,; Talia Ringer; Yuriy Brun

arXiv:2408.09237·cs.SE·December 18, 2024

QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning

Alex Sanchez-Stern, Abhishek Varghese, Zhanna Kaufman, Dylan Zhang,, Talia Ringer, Yuriy Brun

PDF

Open Access

TL;DR

QEDCartographer introduces a reward-free reinforcement learning approach for automated proof synthesis in formal verification, significantly improving proof success rates and efficiency over prior methods.

Contribution

It combines supervised and reinforcement learning to explore proof space more effectively, overcoming sparse reward issues and incorporating proof structure for better automation.

Findings

01

Proves 21.4% of theorems automatically, outperforming prior tools.

02

Produces 34% shorter proofs and is 29% faster than the best previous method.

03

Together with CoqHammer, proves 30.3% of theorems, higher than CoqHammer alone.

Abstract

Formal verification is a promising method for producing reliable software, but the difficulty of manually writing verification proofs severely limits its utility in practice. Recent methods have automated some proof synthesis by guiding a search through the proof space using a theorem prover. Unfortunately, the theorem prover provides only the crudest estimate of progress, resulting in effectively undirected search. To address this problem, we create QEDCartographer, an automated proof-synthesis tool that combines supervised and reinforcement learning to more effectively explore the proof space. QEDCartographer incorporates the proofs' branching structure, enabling reward-free search and overcoming the sparse reward problem inherent to formal verification. We evaluate QEDCartographer using the CoqGym benchmark of 68.5K theorems from 124 open-source Coq projects. QEDCartographer fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques