VeriAct: Beyond Verifiability -- Agentic Synthesis of Correct and Complete Formal Specifications

Md Rakib Hossain Misu; Iris Ma; Cristina V. Lopes

arXiv:2604.00280·cs.SE·April 2, 2026

VeriAct: Beyond Verifiability -- Agentic Synthesis of Correct and Complete Formal Specifications

Md Rakib Hossain Misu, Iris Ma, Cristina V. Lopes

PDF

TL;DR

This paper introduces VeriAct, a novel framework that uses verification-guided, agentic synthesis to generate correct and complete formal specifications, surpassing existing prompt-based methods.

Contribution

The paper presents VeriAct, an innovative agentic approach that iteratively synthesizes and repairs formal specifications using verification feedback, improving correctness and completeness.

Findings

01

Prompt optimization improves verifier pass rates but hits a performance ceiling.

02

Spec-Harness reveals many specifications accepted by verifiers are incorrect or incomplete.

03

VeriAct outperforms baseline methods in producing verifiable, correct, and complete specifications.

Abstract

Formal specifications play a central role in ensuring software reliability and correctness. However, automatically synthesizing high-quality formal specifications remains a challenging task, often requiring domain expertise. Recent work has applied large language models to generate specifications in Java Modeling Language (JML), reporting high verification pass rates. But does passing a verifier mean that the specification is actually correct and complete? In this work, we first conduct a comprehensive evaluation comparing classical and prompt-based approaches for automated JML specification synthesis. We then investigate whether prompt optimization can push synthesis quality further by evolving prompts through structured verification feedback. While optimization improves verifier pass rates, we find a clear performance ceiling. More critically, we propose Spec-Harness, an evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.