Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

Kerri Prinos; Lilianne Brush; Cameron Denton; Zhanqi Wang; Joshua Knox; Snehal Antani; Anton Foltz; Amy Villase\~nor

arXiv:2605.03034·cs.AI·May 6, 2026

Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

Kerri Prinos, Lilianne Brush, Cameron Denton, Zhanqi Wang, Joshua Knox, Snehal Antani, Anton Foltz, Amy Villase\~nor

PDF

TL;DR

This paper introduces a tool-mediated LLM architecture with formal guarantees for autonomous cyber defense, demonstrating stability and robustness against adversarial attacks in real enterprise scenarios.

Contribution

It presents a novel architecture combining deterministic tools and formal certification to ensure controllability and robustness in high-stake decision-making under adversarial conditions.

Findings

01

Reduces attacker's expected payoff by 59% in real attack graphs

02

Certifies controllability and stability using formal methods in Lean 4

03

Demonstrates architectural stability regardless of controller capability

Abstract

Agentic systems involved in high-stake decision-making under adversarial pressure need formal guarantees not offered by existing approaches. Motivated by the operational needs of security operations centers (SOCs) that must configure endpoint detection and response (EDR) policies under adversarial pressure, we present a tool-mediated architecture: LLM agents use deterministic tools (Stackelberg best-response, Bayesian observer updates, attack-graph primitives) and select from finite action catalogs enforced at the tool-output interface. A composite Lyapunov function machine-checked in Lean 4 with zero sorry certifies controllability, observability from asymmetric sensor data, and Input-to-State Stability (ISS) robustness under intelligent adversarial disturbance, with two corollaries extending the certificate to any controller or adversary from the catalogs. On 282 real enterprise…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.