PrivCirNet: Efficient Private Inference via Block Circulant   Transformation

Tianshi Xu; Lemeng Wu; Runsheng Wang; Meng Li

arXiv:2405.14569·cs.CR·October 30, 2024

PrivCirNet: Efficient Private Inference via Block Circulant Transformation

Tianshi Xu, Lemeng Wu, Runsheng Wang, Meng Li

PDF

Open Access 1 Repo 1 Video

TL;DR

PrivCirNet introduces a block circulant transformation to optimize homomorphic encryption-based neural network inference, significantly reducing computation latency and improving accuracy compared to existing methods.

Contribution

It proposes a novel protocol and network co-optimization framework using block circulant matrices, with a customized HE encoding and layer-wise block size search for efficient private inference.

Findings

01

Reduces latency by 5.0x on ResNet-18 with iso-accuracy.

02

Achieves 1.3x latency reduction on Vision Transformer.

03

Improves accuracy by up to 12% over prior methods.

Abstract

Homomorphic encryption (HE)-based deep neural network (DNN) inference protects data and model privacy but suffers from significant computation overhead. We observe transforming the DNN weights into circulant matrices converts general matrix-vector multiplications into HE-friendly 1-dimensional convolutions, drastically reducing the HE computation cost. Hence, in this paper, we propose \method, a protocol/network co-optimization framework based on block circulant transformation. At the protocol level, PrivCirNet customizes the HE encoding algorithm that is fully compatible with the block circulant transformation and reduces the computation latency in proportion to the block size. At the network level, we propose a latency-aware formulation to search for the layer-wise block size assignment based on second-order information. PrivCirNet also leverages layer fusion to further reduce the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tianshi-xu/privcirnet
pytorchOfficial

Videos

PrivCirNet: Efficient Private Inference via Block Circulant Transformation· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Face recognition and analysis

MethodsAttention Is All You Need · Batch Normalization · Pointwise Convolution · Linear Layer · Position-Wise Feed-Forward Layer · Convolution · 1x1 Convolution · Depthwise Convolution · Depthwise Separable Convolution · Inverted Residual Block