When Agents Handle Secrets: A Survey of Confidential Computing for Agentic AI
Javad Forough, Marios Kogias, Hamed Haddadi

TL;DR
This survey explores how confidential computing, especially Trusted Execution Environments, can enhance security in agentic AI systems that handle sensitive data and operate across distributed platforms.
Contribution
It provides a comprehensive taxonomy of TEE platforms, an agent-centric threat model, a comparative analysis of CC defenses, and outlines open challenges for secure agentic AI deployment.
Findings
Six TEE platforms are systematically compared for deployment and performance.
Current CC defenses transfer well to single-call inference but need adaptation for agentic AI.
No complete end-to-end security framework for agentic AI using confidential computing exists yet.
Abstract
Agentic AI systems, specifically LLM-driven agents that plan, invoke tools, maintain persistent memory, and delegate tasks to peer agents via protocols such as MCP and A2A, introduce a threat surface that differs materially from standalone model inference. Agents accumulate sensitive context, hold credentials, and operate across pipelines no single party fully controls, enabling prompt injection, context exfiltration, credential theft, and inter-agent message poisoning. Current defenses operate entirely within the software stack and can be silently bypassed by a sufficiently privileged adversary such as a compromised cloud operator. Confidential computing (CC) offers a hardware-rooted alternative: Trusted Execution Environments (TEEs) isolate agent code and data from privileged system software, while remote attestation enables verifiable trust across distributed deployments. This survey…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
