When Agents Handle Secrets: A Survey of Confidential Computing for Agentic AI

Javad Forough; Marios Kogias; Hamed Haddadi

arXiv:2605.03213·cs.CR·May 8, 2026

When Agents Handle Secrets: A Survey of Confidential Computing for Agentic AI

Javad Forough, Marios Kogias, Hamed Haddadi

PDF

TL;DR

This survey explores how confidential computing, especially Trusted Execution Environments, can enhance security in agentic AI systems that handle sensitive data and operate across distributed platforms.

Contribution

It provides a comprehensive taxonomy of TEE platforms, an agent-centric threat model, a comparative analysis of CC defenses, and outlines open challenges for secure agentic AI deployment.

Findings

01

Six TEE platforms are systematically compared for deployment and performance.

02

Current CC defenses transfer well to single-call inference but need adaptation for agentic AI.

03

No complete end-to-end security framework for agentic AI using confidential computing exists yet.

Abstract

Agentic AI systems, specifically LLM-driven agents that plan, invoke tools, maintain persistent memory, and delegate tasks to peer agents via protocols such as MCP and A2A, introduce a threat surface that differs materially from standalone model inference. Agents accumulate sensitive context, hold credentials, and operate across pipelines no single party fully controls, enabling prompt injection, context exfiltration, credential theft, and inter-agent message poisoning. Current defenses operate entirely within the software stack and can be silently bypassed by a sufficiently privileged adversary such as a compromised cloud operator. Confidential computing (CC) offers a hardware-rooted alternative: Trusted Execution Environments (TEEs) isolate agent code and data from privileged system software, while remote attestation enables verifiable trust across distributed deployments. This survey…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.