HEQuant: Marrying Homomorphic Encryption and Quantization for Communication-Efficient Private Inference
Tianshi Xu, Meng Li, Runsheng Wang

TL;DR
HEQuant introduces low-precision quantization and novel optimization techniques to significantly reduce communication and latency in homomorphic encryption-based private neural network inference.
Contribution
It proposes a quantization-aware optimization framework with intra-coefficient packing and tiling algorithms for efficient HE-based neural network inference.
Findings
Achieves 3.5 to 23.4 times communication reduction compared to prior HE protocols.
Reduces latency by 3.0 to 9.3 times over existing HE-based methods.
Improves communication efficiency by 3.1 to 3.6 times compared to network optimization frameworks.
Abstract
Secure two-party computation with homomorphic encryption (HE) protects data privacy with a formal security guarantee but suffers from high communication overhead. While previous works, e.g., Cheetah, Iron, etc, have proposed efficient HE-based protocols for different neural network (NN) operations, they still assume high precision, e.g., fixed point 37 bit, for the NN operations and ignore NNs' native robustness against quantization error. In this paper, we propose HEQuant, which features low-precision-quantization-aware optimization for the HE-based protocols. We observe the benefit of a naive combination of quantization and HE quickly saturates as bit precision goes down. Hence, to further improve communication efficiency, we propose a series of optimizations, including an intra-coefficient packing algorithm and a quantization-aware tiling algorithm, to simultaneously reduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Data Security · Chaos-based Image/Signal Encryption · Cloud Data Security Solutions
MethodsAverage Pooling · Global Average Pooling · Convolution · Softmax · *Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Squeeze-and-Excitation Block · Dense Connections · Max Pooling · Kaiming Initialization
