Reducing the Costs of Proof Synthesis on Rust Systems by Scaling Up a Seed Training Set

Nongyu Di; Tianyu Chen; Shan Lu; Shuai Lu; Yeyun Gong; Peng Cheng; Jacob R. Lorch; Yuan Yao; Xiaoxing Ma

arXiv:2602.04910·cs.SE·May 12, 2026

Reducing the Costs of Proof Synthesis on Rust Systems by Scaling Up a Seed Training Set

Nongyu Di, Tianyu Chen, Shan Lu, Shuai Lu, Yeyun Gong, Peng Cheng, Jacob R. Lorch, Yuan Yao, Xiaoxing Ma

PDF

TL;DR

This paper introduces VeruSyn, a large-scale data synthesis pipeline for Rust verification, enabling the creation of a vast dataset of verified programs to improve proof synthesis in LLMs.

Contribution

The paper presents VeruSyn, a novel data synthesis method that significantly scales up verified Rust programs with proofs, enhancing LLM capabilities for formal verification tasks.

Findings

01

Generated 6.9 million verified Rust programs with proofs.

02

Fine-tuned a Qwen2.5-Coder model with improved proof capabilities.

03

Outperformed existing models like o4-mini in proof synthesis tasks.

Abstract

Large Language Models (LLMs) are widely used for code generation. However, the correctness of code generated by LLMs remains a concern. A potential remedy to this concern is to have LLMs generate formal correctness proofs along with such code. However, compared with code generation, code-proof generation requires much higher reasoning capability and has much less existing data to learn from. In this paper, we present VeruSyn, a data synthesis pipeline for Verus, a state-of-the-art verification tool for system software written in Rust. Through self-synthesis and tutorial-based synthesis, VeruSyn achieves much larger scale and Verus-feature coverage than previous data-synthesis techniques designed for Verus; VeruSyn also supplements its dataset with long-chain-of-thought (CoT) data through agent trajectory synthesis. With VeruSyn, we synthesize the largest set of Verus verified programs:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.