Evaluating the Ability of Large Language Models to Generate Verifiable Specifications in VeriFast
Wen Fan, Marilyn Rego, Xin Hu, Sanya Dod, Zhaorui Ni, Danning Xie,, Jenna DiVincenzo, Lin Tan

TL;DR
This paper evaluates GPT-4o's ability to generate verifiable specifications for C programs in VeriFast, revealing strengths in preserving behavior but challenges in verifiability and redundancy issues.
Contribution
It is the first study to assess large language models' effectiveness in generating ownership logic specifications for static verifiers like VeriFast.
Findings
GPT-4o preserves functional behavior in generated specifications.
Specifications often struggle to be verifiable.
Redundancies are common in verifiable specifications.
Abstract
Static verification is a powerful method for enhancing software quality, but it demands significant human labor and resources. This is particularly true of static verifiers that reason about heap manipulating programs using an ownership logic. LLMs have shown promise in a number of software engineering activities, including code generation, test generation, proof generation for theorem provers, and specification generation for static verifiers. However, prior work has not explored how well LLMs can perform specification generation for specifications based in an ownership logic, such as separation logic. To address this gap, this paper explores OpenAI's GPT-4o model's effectiveness in generating specifications on C programs that are verifiable with VeriFast, a separation logic based static verifier. Our experiment employs three different types of user inputs as well as basic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Business Process Modeling and Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Cosine Annealing · Adam · Attention Dropout · Multi-Head Attention · Weight Decay · Byte Pair Encoding · Dropout
