SEVerA: Verified Synthesis of Self-Evolving Agents

Debangshu Banerjee; Changming Xu; Eugene Ie; Ming Zhang; Daiyi Peng; Chu-Cheng Lin; Gagandeep Singh

arXiv:2603.25111·cs.LG·April 28, 2026

SEVerA: Verified Synthesis of Self-Evolving Agents

Debangshu Banerjee, Changming Xu, Eugene Ie, Ming Zhang, Daiyi Peng, Chu-Cheng Lin, Gagandeep Singh

PDF

1 Repo

TL;DR

SEVerA is a framework that synthesizes self-evolving agents with formal guarantees of safety and correctness, combining formal specifications with learning to improve performance.

Contribution

The paper introduces FGGM and SEVerA, a novel three-stage framework that ensures verified, safe, and high-performing self-evolving agents using formal methods and learning.

Findings

01

SEVerA achieves zero constraint violations across multiple tasks.

02

It improves performance over unconstrained and state-of-the-art baselines.

03

Formal behavioral constraints guide synthesis toward higher-quality agents.

Abstract

Recent advances have shown the effectiveness of self-evolving LLM agents on tasks such as program repair and scientific discovery. In this paradigm, a planner LLM synthesizes an agent program that invokes parametric models, including LLMs, which are then tuned per task to improve performance. However, existing self-evolving agent frameworks provide no formal guarantees of safety or correctness. Because such programs are often executed autonomously on unseen inputs, this lack of guarantees raises reliability and security concerns. We formulate agentic code generation as a constrained learning problem, combining hard formal specifications with soft objectives capturing task utility. We introduce Formally Guarded Generative Models (FGGM), which allow the planner LLM to specify a formal output contract for each generative model call using first-order logic. Each FGGM call wraps the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uiuc-focal-lab/severa
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.