Secure eFPGA-Enabled Edge LLM Inference: Architectural and Hardware Countermeasures

Voktho Das; M Zafir Sadik Khan; Jafar Vafaei; Kimia Azar; Hadi Kamali

arXiv:2604.22935·cs.CR·April 28, 2026

Secure eFPGA-Enabled Edge LLM Inference: Architectural and Hardware Countermeasures

Voktho Das, M Zafir Sadik Khan, Jafar Vafaei, Kimia Azar, Hadi Kamali

PDF

TL;DR

This paper proposes a hybrid ASIC+eFPGA architecture for edge transformer inference that enhances security against side-channel, fault injection, and supply-chain attacks while maintaining high performance.

Contribution

It introduces a novel integrated architecture combining ASIC and eFPGA to enable security mechanisms like runtime monitoring and patching for edge LLM inference.

Findings

01

Enhanced security against side-channel and fault injection attacks.

02

Maintained high performance of ASIC-based transformer inference.

03

Enabled post-deployment security updates via reconfigurable logic.

Abstract

Edge deployment of transformer-based models increasingly relies on ASIC accelerators due to their high performance and energy efficiency, achieved through optimized dataflows, specialized architectures, low-bitwidth computation, and efficient memory hierarchies. However, these advantages come with significant security vulnerabilities. ASIC-based DNN accelerators are susceptible to side-channel attacks (e.g., power, electromagnetic, and timing analysis) and fault injection attacks (e.g., voltage manipulation, clock glitches, and memory perturbations), which can lead to model extraction or compromised inference integrity. Furthermore, threats introduced during design and fabrication, such as hardware Trojans or untrusted third-party IPs, further expand the attack surface. To address these challenges, we explore a hybrid ASIC+eFPGA architecture that combines the efficiency of ASICs with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.