FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Xiaohan Lin, Qingxing Cao, Yinya Huang, Haiming Wang, Jianqiao Lu,, Zhengying Liu, Linqi Song, Xiaodan Liang

TL;DR
FVEL introduces an interactive environment that combines large language models with formal theorem proving in Isabelle to enhance the flexibility and effectiveness of formal verification for code.
Contribution
The paper presents FVEL, a novel environment integrating LLMs with Isabelle for automated theorem proving, supported by a large-scale dataset FVELER for training and benchmarking.
Findings
FVEL with fine-tuned LLMs solves more verification problems.
The approach reduces proof errors compared to baseline methods.
Benchmark results show significant improvement in problem-solving capability.
Abstract
Formal verification (FV) has witnessed growing significance with current emerging program synthesis by the evolving large language models (LLMs). However, current formal verification mainly resorts to symbolic verifiers or hand-craft rules, resulting in limitations for extensive and flexible verification. On the other hand, formal languages for automated theorem proving, such as Isabelle, as another line of rigorous verification, are maintained with comprehensive rules and theorems. In this paper, we propose FVEL, an interactive Formal Verification Environment with LLMs. Specifically, FVEL transforms a given code to be verified into Isabelle, and then conducts verification via neural automated theorem proving with an LLM. The joined paradigm leverages the rigorous yet abundant formulated and organized rules in Isabelle and is also convenient for introducing and adjusting cutting-edge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Model-Driven Software Engineering Techniques · Semantic Web and Ontologies
