Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs

Ido Pinto; Yizhak Yisrael Elboher; Haoze Wu; Nina Narodytska; Guy Katz

arXiv:2603.15510·cs.LG·March 17, 2026

Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs

Ido Pinto, Yizhak Yisrael Elboher, Haoze Wu, Nina Narodytska, Guy Katz

PDF

Open Access

TL;DR

This paper introduces a data curation pipeline that improves the training of Small Language Models for program invariant synthesis, significantly enhancing their accuracy and efficiency in verification tasks.

Contribution

It presents a novel data curation method, Wonda, that refines verifier-generated invariants for effective fine-tuning of SLMs, leading to substantial performance improvements.

Findings

01

Fine-tuned 4B SLM matches GPT-OSS-120B utility.

02

Approaches state-of-the-art GPT-5.2 without extra reasoning time.

03

Doubles correctness and speedup on InvBench instances.

Abstract

The synthesis of inductive loop invariants is a critical bottleneck in automated program verification. While Large Language Models (LLMs) show promise in mitigating this issue, they often fail on hard instances, generating invariants that are invalid or computationally ineffective. While fine-tuning is a natural route to mitigate this limitation, obtaining high-quality training data for invariant generation remains an open challenge. We present a rigorous data curation pipeline designed to extract high-quality training signals from raw verifier-generated invariants. First, we formalize the properties required for a high-quality training invariant. Second, we propose Wonda, a pipeline that refines noisy data via AST-based normalization, followed by LLM-driven semantic rewriting and augmentation with provable quality guarantees. We demonstrate that fine-tuning Small Language Models (SLMs)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, programming, and type systems · Adversarial Robustness in Machine Learning · Security and Verification in Computing