Enabling Homomorphically Encrypted Inference for Large DNN Models

Guillermo Lloret-Talavera; Marc Jorda; Harald Servat; Fabian Boemer,; Chetan Chauhan; Shigeki Tomishima; Nilesh N. Shah; Antonio J. Pe\~na

arXiv:2103.16139·cs.CR·April 30, 2021

Enabling Homomorphically Encrypted Inference for Large DNN Models

Guillermo Lloret-Talavera, Marc Jorda, Harald Servat, Fabian Boemer,, Chetan Chauhan, Shigeki Tomishima, Nilesh N. Shah, Antonio J. Pe\~na

PDF

TL;DR

This paper demonstrates that using hybrid DRAM and persistent memory systems, specifically Intel Optane PMem, makes homomorphically encrypted inference of large DNNs feasible and efficient, overcoming previous resource limitations.

Contribution

It introduces a novel approach leveraging hybrid memory systems and HE-Transformer nGraph to enable large DNN inference with homomorphic encryption, which was previously impractical due to resource constraints.

Findings

01

Homomorphic encryption inference benefits from hybrid memory access patterns.

02

Large models like MobileNetV2 and ResNet-50 can be run efficiently with HE on new hardware.

03

Analysis shows improved performance with specific hardware and software configurations.

Abstract

The proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime overheads. Secure deep neural network (DNN) inference using HE is currently limited by computing and memory resources, with frameworks requiring hundreds of gigabytes of DRAM to evaluate small models. To overcome these limitations, in this paper we explore the feasibility of leveraging hybrid memory systems comprised of DRAM and persistent memory. In particular, we explore the recently-released Intel Optane PMem technology and the Intel HE-Transformer nGraph to run large neural networks such as MobileNetV2 (in its largest variant) and ResNet-50 for the first time in the literature. We present an in-depth analysis of the efficiency of the executions with different hardware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPointwise Convolution · Batch Normalization · Average Pooling · Depthwise Convolution · Depthwise Separable Convolution · Inverted Residual Block · 1x1 Convolution · Convolution · Tether Customer Service Number +1-833-534-1729