Towards Deep Encrypted Training: Low-Latency, Memory-Efficient, and High-Throughput Inference for Privacy-Preserving Neural Networks
Nges Brian Njungle, Eric Jahns, Michel A. Kinsy

TL;DR
This paper develops optimized algorithms and a pipeline architecture for batched homomorphic encryption neural network inference, significantly improving speed and memory efficiency for privacy-preserving applications.
Contribution
It introduces novel algorithms and a pipeline design tailored for batched HE inference, enabling high-throughput and resource-efficient encrypted neural network processing.
Findings
Achieved 1.78x faster inference time for ResNet-20 on encrypted CIFAR-10 images.
Reduced memory usage by 3.74x compared to previous methods.
Demonstrated scalable encrypted inference for deeper ResNet-34 models.
Abstract
Privacy-preserving machine learning (PPML) has become increasingly important in applications where sensitive data must remain confidential. Homomorphic Encryption (HE) enables computation directly on encrypted data, allowing neural network inference without revealing raw inputs. While prior works have largely focused on inference over a single encrypted image, batch processing of encrypted inputs lags behind, despite being critical for high-throughput inference scenarios and training-oriented workloads. In this work, we address this gap by developing optimized algorithms for batched HE-friendly neural networks. We also introduced a pipeline architecture designed to maximize resource efficiency for different batch size execution. We implemented these algorithms and evaluated our work using HE-friendly ResNet-20 and ResNet-34 models on encrypted CIFAR-10 and CIFAR-100 datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
