GPU Acceleration of Sparse Fully Homomorphic Encrypted DNNs

Lara D'Agata; Carlos Agull\'o-Domingo; \'Oscar Vera-L\'opez; Kaustubh Shivdikar; Ardhi W. B. Yudha; Ferhat Yaman; David Kaeli; Jos\'e L. Abell\'an; Ian Colbert; Jos\'e Cano

arXiv:2604.11659·cs.CR·April 14, 2026

GPU Acceleration of Sparse Fully Homomorphic Encrypted DNNs

Lara D'Agata, Carlos Agull\'o-Domingo, \'Oscar Vera-L\'opez, Kaustubh Shivdikar, Ardhi W. B. Yudha, Ferhat Yaman, David Kaeli, Jos\'e L. Abell\'an, Ian Colbert, Jos\'e Cano

PDF

TL;DR

This paper presents a GPU-accelerated method for efficient sparse matrix multiplication in fully homomorphic encryption, significantly improving performance and complexity for deep neural network computations.

Contribution

It introduces a novel GPU-based sparse FHE matrix multiplication technique that outperforms CPU methods and reduces computational complexity.

Findings

01

GPU implementation outperforms CPU by up to 3.0×

02

Reduces time complexity from cubic to semi-linear

03

Demonstrates practical acceleration for FHE-based neural network inference

Abstract

Fully homomorphic encryption (FHE) has recently attracted significant attention as both a cryptographic primitive and a systems challenge. Given the latest advances in accelerated computing, FHE presents a promising opportunity for progress, with applications ranging from machine learning to information security. We target the most computationally intensive operation in deep neural networks from a hardware perspective, matrix multiplication (matmul), and adapt it for execution on AMD GPUs. We propose a new optimized method that improves the runtime and complexity of ciphertext matmul by using FIDESlib, a recent open-source FHE library designed specifically for GPUs. By exploiting sparsity in both operands, our sparse matmul implementation outperforms its CPU counterpart by up to $3.0 \times$ and reduces the time complexity from cubic to semi-linear, demonstrating an improvement over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.