Certified Program Synthesis with a Multi-Modal Verifier
Yueyang Feng, Dipesh Kafle, Vladimir Gladshtein, Vitaly Kurin, George P\^irlea, Qiyuan Zhao, Peter M\"uller, Ilya Sergey

TL;DR
This paper introduces LeetProof, a multi-modal verifier-based framework for certified program synthesis that combines dynamic validation, automated proofs, and interactive scripting to improve certification success rates.
Contribution
It presents LeetProof, a novel agentic pipeline leveraging a multi-modal verifier to address specification defects and enhance synthesis across different verification paradigms.
Findings
LeetProof uncovers defects in existing benchmarks.
LeetProof achieves higher certification rates than single-mode baselines.
Validation with LeetProof improves synthesis reliability.
Abstract
Certified program synthesis (aka vericoding) is the process of automatically generating a program, its formal specification, and a machine-checkable proof of their alignment from a natural-language description. Two challenges make vericoding difficult. First, specifications synthesised from natural language are often either too weak to be meaningful or too strong to be implementable, yet existing approaches lack systematic means to detect such defects. Second, the landscape of program verifiers is fragmented: each tool supports a particular reasoning mode -- auto-active (e.g., Dafny, Verus) or interactive (e.g., Coq, Lean) -- with its own trade-off between automation and expressivity. This forces every synthesis methodology to be tailored to a single verification paradigm, limiting the class of tasks it can handle effectively. We overcome both challenges by structuring the certified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
