TL;DR
SEVerA is a framework that synthesizes self-evolving agents with formal guarantees of safety and correctness, combining formal specifications with learning to improve performance.
Contribution
The paper introduces FGGM and SEVerA, a novel three-stage framework that ensures verified, safe, and high-performing self-evolving agents using formal methods and learning.
Findings
SEVerA achieves zero constraint violations across multiple tasks.
It improves performance over unconstrained and state-of-the-art baselines.
Formal behavioral constraints guide synthesis toward higher-quality agents.
Abstract
Recent advances have shown the effectiveness of self-evolving LLM agents on tasks such as program repair and scientific discovery. In this paradigm, a planner LLM synthesizes an agent program that invokes parametric models, including LLMs, which are then tuned per task to improve performance. However, existing self-evolving agent frameworks provide no formal guarantees of safety or correctness. Because such programs are often executed autonomously on unseen inputs, this lack of guarantees raises reliability and security concerns. We formulate agentic code generation as a constrained learning problem, combining hard formal specifications with soft objectives capturing task utility. We introduce Formally Guarded Generative Models (FGGM), which allow the planner LLM to specify a formal output contract for each generative model call using first-order logic. Each FGGM call wraps the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
