Falcon: Accelerating Homomorphically Encrypted Convolutions for   Efficient Private Mobile Network Inference

Tianshi Xu; Meng Li; Runsheng Wang; Ru Huang

arXiv:2308.13189·cs.CR·August 28, 2023

Falcon: Accelerating Homomorphically Encrypted Convolutions for Efficient Private Mobile Network Inference

Tianshi Xu, Meng Li, Runsheng Wang, Ru Huang

PDF

Open Access

TL;DR

Falcon introduces a novel packing algorithm that significantly accelerates homomorphic encryption-based private inference for mobile networks, improving latency and accuracy over previous methods.

Contribution

The paper presents Falcon, a dense packing algorithm with zero-aware greedy packing and communication-aware tiling, optimized for depthwise convolutions in HE-based 2PC frameworks.

Findings

01

Achieves over 15.6x latency reduction compared to CrypTFlow2.

02

Improves accuracy by 1.4% on CIFAR-100 and 4.2% on TinyImagenet.

03

Reduces inference overhead in HE-based private mobile network inference.

Abstract

Efficient networks, e.g., MobileNetV2, EfficientNet, etc, achieves state-of-the-art (SOTA) accuracy with lightweight computation. However, existing homomorphic encryption (HE)-based two-party computation (2PC) frameworks are not optimized for these networks and suffer from a high inference overhead. We observe the inefficiency mainly comes from the packing algorithm, which ignores the computation characteristics and the communication bottleneck of homomorphically encrypted depthwise convolutions. Therefore, in this paper, we propose Falcon, an effective dense packing algorithm for HE-based 2PC frameworks. Falcon features a zero-aware greedy packing algorithm and a communication-aware operator tiling strategy to improve the packing density for depthwise convolutions. Compared to SOTA HE-based 2PC frameworks, e.g., CrypTFlow2, Iron and Cheetah, Falcon achieves more than 15.6x, 5.1x and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCryptography and Data Security · MXene and MAX Phase Materials · Complexity and Algorithms in Graphs

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Sigmoid Activation · Pointwise Convolution · Squeeze-and-Excitation Block · Depthwise Separable Convolution · Batch Normalization · Dense Connections · Convolution · Dropout