DCT-CryptoNets: Scaling Private Inference in the Frequency Domain
Arjun Roy, Kaushik Roy

TL;DR
DCT-CryptoNets leverages frequency domain transforms to significantly improve the efficiency and scalability of private inference in deep neural networks using fully homomorphic encryption, enabling faster and more practical encrypted image classification.
Contribution
It introduces a novel frequency-domain approach using DCT to reduce computational costs and improve scalability of FHE-based private inference for deep learning.
Findings
Achieves up to 5.3× latency reduction compared to prior methods.
Enables inference on ImageNet within 2.5 hours, down from 12.5 hours.
Reduces error accumulation and improves scalability with image size.
Abstract
The convergence of fully homomorphic encryption (FHE) and machine learning offers unprecedented opportunities for private inference of sensitive data. FHE enables computation directly on encrypted data, safeguarding the entire machine learning pipeline, including data and model confidentiality. However, existing FHE-based implementations for deep neural networks face significant challenges in computational cost, latency, and scalability, limiting their practical deployment. This paper introduces DCT-CryptoNets, a novel approach that operates directly in the frequency-domain to reduce the burden of computationally expensive non-linear activations and homomorphic bootstrap operations during private inference. It does so by utilizing the discrete cosine transform (DCT), commonly employed in JPEG encoding, which has inherent compatibility with remote computing services where images are…
Peer Reviews
Decision·ICLR 2025 Poster
$\bullet$ Performing TFHE-based inference in the frequency domain allows the usage of lower-resolution inputs without compromising accuracy. This approach significantly decreases both FLOPs and nonlinear operations (ReLU), while also requiring fewer bootstrapping operations. This results in substantial speedup benefits. More importantly, this makes it feasible to perform inference on larger input images, enhancing the practical applicability (such as semantic segmentation) of homomorphic encrypt
$\bullet$ The lack of sufficient algorithmic contributions and research insights makes it less suitable for the ML conference. Operating in the frequency domain for private inference benefits is not a novel concept (see [1,2]). Also, the usage of quantization-aware training for lower-frequency components is simply an engineering tweak. Thus, a more fitting venue for this work might be a cryptography-focused conference. $\bullet$ Moreover, the practicality of HE-only private inference remains
Novel DCT based insight for unstructured data yielding excellent computational advantages for high resolution images. Thorough benchmarking and attention to reproducible science. Excellent comparative analysis of this CKKS based technique relative to competing TFHE approaches. Excellent and comprehensive list of references that chronicle the current state of the art and prior advances.
I find no obvious weaknesses. I would emphasize to the reader perhaps new to the field ( in Section 3.2 on page 6) that model training is done in the plaintext domain. (even though this is most evident in Figure 3 presented on Page 14.
1. **Frequency-Domain Optimization**: DCT-CryptoNets leverage the Discrete Cosine Transform (DCT) to focus on low-frequency components of images, which enhances the model's ability to capture perceptually salient information while reducing computational complexity and improving accuracy compared to traditional RGB-based networks. 2. **Reduced Latency and Improved Scalability**: The proposed method achieves significant latency reductions (up to 5.3×) during inference, especially on large dataset
I have a question about the threat model setting here, why the model is trained locally? If the client could train the model by themselves, why does they do inference locally. Or if they want to deploy that encrypted model (key from model trainer) on the cloud for other clients as service. How does the key management should be solved?
Code & Models
Videos
Taxonomy
TopicsDigital Media Forensic Detection · Chaos-based Image/Signal Encryption · Advanced Malware Detection Techniques
MethodsBitcoin Customer Service Number +1-833-534-1729 · Discrete Cosine Transform
