Position: A Three-Layer Probabilistic Assume-Guarantee Architecture Is Structurally Required for Safe LLM Agent Deployment

S.Bensalem; Y. Dong; M. Franzle; X. Huang; J. Kroger; D. Nickovic; A. Nouri; R. Roy; C. Wu

arXiv:2605.18672·cs.AI·May 19, 2026

Position: A Three-Layer Probabilistic Assume-Guarantee Architecture Is Structurally Required for Safe LLM Agent Deployment

S.Bensalem, Y. Dong, M. Franzle, X. Huang, J. Kroger, D. Nickovic, A. Nouri, R. Roy, C. Wu

PDF

TL;DR

The paper argues for a three-layer probabilistic architecture for safe LLM agent deployment, emphasizing the need for independent safety guarantees across different execution stages.

Contribution

It proposes a contract-based, three-layer architecture with probabilistic guarantees, addressing limitations of single-layer safety approaches for LLM agents.

Findings

01

A three-layer architecture can provide compositional safety guarantees.

02

Probabilistic safety bounds can be derived using the chain rule of probability.

03

Identifies key open problems for deploying this architecture in practice.

Abstract

This position paper argues that enforcing LLM agent safety within a single abstraction layer is not merely suboptimal but categorically insufficient for deployed LLM agents -- a structural consequence of how agent execution works, not a contingent limitation of current systems. The three dimensions that jointly constitute safe operation -- semantic intent and policy compliance, environmental validity, and dynamical feasibility -- each depend on a strictly distinct set of information that becomes available at different stages of execution. No single guardrail can certify all three. We argue that the community must respond with a contract-based architecture in which each safety dimension is enforced by an independently certified layer whose probabilistic guarantee satisfies the next layer's assumption. We sketch such an architecture and derive the compositional system-level safety bounds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.