Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design

Dong Ben; Hui Feng; Qian Wang

arXiv:2507.16226·cs.AI·July 23, 2025

Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design

Dong Ben, Hui Feng, Qian Wang

PDF

Open Access

TL;DR

This paper evaluates the performance of distilled large language models within confidential computing environments, demonstrating their suitability for resource-constrained system-on-chip applications in semiconductor design.

Contribution

It provides a comprehensive analysis of deploying lightweight LLMs in TEE environments like Intel TDX, highlighting performance gains with quantization and model distillation.

Findings

01

Distilled models outperform larger ones in resource-constrained environments.

02

Quantization (Q4, Q8) improves performance up to 3x over FP16.

03

TDX outperforms CPU in secure execution for smaller models.

Abstract

Large Language Models (LLMs) are increasingly used in circuit design tasks and have typically undergone multiple rounds of training. Both the trained models and their associated training data are considered confidential intellectual property (IP) and must be protected from exposure. Confidential Computing offers a promising solution to protect data and models through Trusted Execution Environments (TEEs). However, existing TEE implementations are not designed to support the resource-intensive nature of LLMs efficiently. In this work, we first present a comprehensive evaluation of the LLMs within a TEE-enabled confidential computing environment, specifically utilizing Intel Trust Domain Extensions (TDX). We constructed experiments on three environments: TEE-based, CPU-only, and CPU-GPU hybrid implementations, and evaluated their performance in terms of tokens per second. Our first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhysical Unclonable Functions (PUFs) and Hardware Security · Security and Verification in Computing · Cryptography and Data Security