Inferring multiple helper Dafny assertions with LLMs

\'Alvaro Silva; Alexandra Mendes; Ruben Martins

arXiv:2511.00125·cs.SE·November 4, 2025

Inferring multiple helper Dafny assertions with LLMs

\'Alvaro Silva, Alexandra Mendes, Ruben Martins

PDF

Open Access

TL;DR

This paper presents DAISY, a tool using LLMs to automatically infer missing helper assertions in Dafny programs, significantly reducing manual effort and improving verification scalability.

Contribution

The paper introduces a novel hybrid approach combining LLMs and heuristics for inferring multiple missing assertions in Dafny, supported by a new benchmark and taxonomy.

Findings

01

DAISY verifies 63.4% of single-assertion cases

02

DAISY verifies 31.7% of multiple-assertion cases

03

Many proofs can be completed with fewer assertions than originally provided

Abstract

The Dafny verifier provides strong correctness guarantees but often requires numerous manual helper assertions, creating a significant barrier to adoption. We investigate the use of Large Language Models (LLMs) to automatically infer missing helper assertions in Dafny programs, with a primary focus on cases involving multiple missing assertions. To support this study, we extend the DafnyBench benchmark with curated datasets where one, two, or all assertions are removed, and we introduce a taxonomy of assertion types to analyze inference difficulty. Our approach refines fault localization through a hybrid method that combines LLM predictions with error-message heuristics. We implement this approach in a new tool called DAISY (Dafny Assertion Inference SYstem). While our focus is on multiple missing assertions, we also evaluate DAISY on single-assertion cases. DAISY verifies 63.4% of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Formal Methods in Verification · Logic, programming, and type systems