DCT-CryptoNets: Scaling Private Inference in the Frequency Domain

Arjun Roy; Kaushik Roy

arXiv:2408.15231·cs.CR·January 24, 2025

DCT-CryptoNets: Scaling Private Inference in the Frequency Domain

Arjun Roy, Kaushik Roy

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

DCT-CryptoNets leverages frequency domain transforms to significantly improve the efficiency and scalability of private inference in deep neural networks using fully homomorphic encryption, enabling faster and more practical encrypted image classification.

Contribution

It introduces a novel frequency-domain approach using DCT to reduce computational costs and improve scalability of FHE-based private inference for deep learning.

Findings

01

Achieves up to 5.3× latency reduction compared to prior methods.

02

Enables inference on ImageNet within 2.5 hours, down from 12.5 hours.

03

Reduces error accumulation and improves scalability with image size.

Abstract

The convergence of fully homomorphic encryption (FHE) and machine learning offers unprecedented opportunities for private inference of sensitive data. FHE enables computation directly on encrypted data, safeguarding the entire machine learning pipeline, including data and model confidentiality. However, existing FHE-based implementations for deep neural networks face significant challenges in computational cost, latency, and scalability, limiting their practical deployment. This paper introduces DCT-CryptoNets, a novel approach that operates directly in the frequency-domain to reduce the burden of computationally expensive non-linear activations and homomorphic bootstrap operations during private inference. It does so by utilizing the discrete cosine transform (DCT), commonly employed in JPEG encoding, which has inherent compatibility with remote computing services where images are…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 5Confidence 4

Strengths

$\bullet$ Performing TFHE-based inference in the frequency domain allows the usage of lower-resolution inputs without compromising accuracy. This approach significantly decreases both FLOPs and nonlinear operations (ReLU), while also requiring fewer bootstrapping operations. This results in substantial speedup benefits. More importantly, this makes it feasible to perform inference on larger input images, enhancing the practical applicability (such as semantic segmentation) of homomorphic encrypt

Weaknesses

$\bullet$ The lack of sufficient algorithmic contributions and research insights makes it less suitable for the ML conference. Operating in the frequency domain for private inference benefits is not a novel concept (see [1,2]). Also, the usage of quantization-aware training for lower-frequency components is simply an engineering tweak. Thus, a more fitting venue for this work might be a cryptography-focused conference. $\bullet$ Moreover, the practicality of HE-only private inference remains

Reviewer 02Rating 8Confidence 4

Strengths

Novel DCT based insight for unstructured data yielding excellent computational advantages for high resolution images. Thorough benchmarking and attention to reproducible science. Excellent comparative analysis of this CKKS based technique relative to competing TFHE approaches. Excellent and comprehensive list of references that chronicle the current state of the art and prior advances.

Weaknesses

I find no obvious weaknesses. I would emphasize to the reader perhaps new to the field ( in Section 3.2 on page 6) that model training is done in the plaintext domain. (even though this is most evident in Figure 3 presented on Page 14.

Reviewer 03Rating 6Confidence 3

Strengths

1. **Frequency-Domain Optimization**: DCT-CryptoNets leverage the Discrete Cosine Transform (DCT) to focus on low-frequency components of images, which enhances the model's ability to capture perceptually salient information while reducing computational complexity and improving accuracy compared to traditional RGB-based networks. 2. **Reduced Latency and Improved Scalability**: The proposed method achieves significant latency reductions (up to 5.3×) during inference, especially on large dataset

Weaknesses

I have a question about the threat model setting here, why the model is trained locally? If the client could train the model by themselves, why does they do inference locally. Or if they want to deploy that encrypted model (key from model trainer) on the cloud for other clients as service. How does the key management should be solved?

Code & Models

Repositories

ar-roy/dct-cryptonets
pytorchOfficial

Videos

DCT-CryptoNets: Scaling Private Inference in the Frequency Domain· slideslive

Taxonomy

TopicsDigital Media Forensic Detection · Chaos-based Image/Signal Encryption · Advanced Malware Detection Techniques

MethodsBitcoin Customer Service Number +1-833-534-1729 · Discrete Cosine Transform