PrivCirNet: Efficient Private Inference via Block Circulant Transformation
Tianshi Xu, Lemeng Wu, Runsheng Wang, Meng Li

TL;DR
PrivCirNet introduces a block circulant transformation to optimize homomorphic encryption-based neural network inference, significantly reducing computation latency and improving accuracy compared to existing methods.
Contribution
It proposes a novel protocol and network co-optimization framework using block circulant matrices, with a customized HE encoding and layer-wise block size search for efficient private inference.
Findings
Reduces latency by 5.0x on ResNet-18 with iso-accuracy.
Achieves 1.3x latency reduction on Vision Transformer.
Improves accuracy by up to 12% over prior methods.
Abstract
Homomorphic encryption (HE)-based deep neural network (DNN) inference protects data and model privacy but suffers from significant computation overhead. We observe transforming the DNN weights into circulant matrices converts general matrix-vector multiplications into HE-friendly 1-dimensional convolutions, drastically reducing the HE computation cost. Hence, in this paper, we propose \method, a protocol/network co-optimization framework based on block circulant transformation. At the protocol level, PrivCirNet customizes the HE encoding algorithm that is fully compatible with the block circulant transformation and reduces the computation latency in proportion to the block size. At the network level, we propose a latency-aware formulation to search for the layer-wise block size assignment based on second-order information. PrivCirNet also leverages layer fusion to further reduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Face recognition and analysis
MethodsAttention Is All You Need · Batch Normalization · Pointwise Convolution · Linear Layer · Position-Wise Feed-Forward Layer · Convolution · 1x1 Convolution · Depthwise Convolution · Depthwise Separable Convolution · Inverted Residual Block
