Evaluating the Performance of the DeepSeek Model in Confidential   Computing Environment

Ben Dong; Qian Wang

arXiv:2502.11347·cs.PF·February 18, 2025

Evaluating the Performance of the DeepSeek Model in Confidential Computing Environment

Ben Dong, Qian Wang

PDF

Open Access

TL;DR

This paper evaluates the performance of the DeepSeek large language model within Intel TDX-based confidential computing environments, demonstrating promising results for secure, resource-efficient AI deployment.

Contribution

First comprehensive performance evaluation of DeepSeek in TEE-enabled confidential computing, comparing CPU, GPU, and hybrid implementations for secure LLM inference.

Findings

01

TDX outperforms CPU in small model secure inference

02

GPU-to-CPU performance ratio averages 12 across models

03

Provides insights for optimizing confidential AI deployments

Abstract

The increasing adoption of Large Language Models (LLMs) in cloud environments raises critical security concerns, particularly regarding model confidentiality and data privacy. Confidential computing, enabled by Trusted Execution Environments (TEEs), offers a promising solution to mitigate these risks. However, existing TEE implementations, primarily CPU-based, struggle to efficiently support the resource-intensive nature of LLM inference and training. In this work, we present the first evaluation of the DeepSeek model within a TEE-enabled confidential computing environment, specifically utilizing Intel Trust Domain Extensions (TDX). Our study benchmarks DeepSeek's performance across CPU-only, CPU-GPU hybrid, and TEE-based implementations. For smaller parameter sets, such as DeepSeek-R1-1.5B, the TDX implementation outperforms the CPU version in executing computations within a secure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data