Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads
Aritra Dhar, Cl\'ement Thorens, Lara Magdalena Lazier, Lukas Cavigelli

TL;DR
Ascend-CC is a novel confidential computing architecture leveraging heterogeneous NPUs to securely run large language models without trusting the host system, ensuring data and model protection with minimal performance overhead.
Contribution
It introduces a hardware-agnostic, device-centric confidential computing solution for heterogeneous NPUs that secures AI workloads without requiring modifications to existing AI software stacks.
Findings
Achieves strong security guarantees for LLMs on NPU hardware.
Implements delegation-based memory semantics for isolation.
Demonstrates minimal overhead in Llama2 and Llama3 evaluations.
Abstract
Cloud workloads have dominated generative AI based on large language models (LLM). Specialized hardware accelerators, such as GPUs, NPUs, and TPUs, play a key role in AI adoption due to their superior performance over general-purpose CPUs. The AI models and the data are often highly sensitive and come from mutually distrusting parties. Existing CPU-based TEEs such as Intel SGX or AMD SEV do not provide sufficient protection. Device-centric TEEs like Nvidia-CC only address tightly coupled CPU-GPU systems with a proprietary solution requiring TEE on the host CPU side. On the other hand, existing academic proposals are tailored toward specific CPU-TEE platforms. To address this gap, we propose Ascend-CC, a confidential computing architecture based on discrete NPU devices that requires no trust in the host system. Ascend-CC provides strong security by ensuring data and model encryption…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Cloud Data Security Solutions · Chaos-based Image/Signal Encryption
