Fast and Private Inference of Deep Neural Networks by Co-designing   Activation Functions

Abdulrahman Diaa; Lucas Fenaux; Thomas Humphries; Marian Dietz; Faezeh; Ebrahimianghazani; Bailey Kacsmar; Xinda Li; Nils Lukas; Rasoul Akhavan; Mahdavi; Simon Oya; Ehsan Amjadian; Florian Kerschbaum

arXiv:2306.08538·cs.CR·April 17, 2024·1 cites

Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions

Abdulrahman Diaa, Lucas Fenaux, Thomas Humphries, Marian Dietz, Faezeh, Ebrahimianghazani, Bailey Kacsmar, Xinda Li, Nils Lukas, Rasoul Akhavan, Mahdavi, Simon Oya, Ehsan Amjadian, Florian Kerschbaum

PDF

Open Access 1 Repo

TL;DR

This paper introduces a co-designed polynomial activation function and a new training algorithm to enable fast, private inference of deep neural networks using multi-party computation, achieving significant speedups while maintaining accuracy.

Contribution

It proposes a novel polynomial activation function and training method that together enable efficient, privacy-preserving neural network inference with improved speed and accuracy.

Findings

01

Achieves 3 to 110 times faster inference times.

02

Maintains competitive accuracy with plaintext models.

03

Scales to large models with up to 23 million parameters.

Abstract

Machine Learning as a Service (MLaaS) is an increasingly popular design where a company with abundant computing resources trains a deep neural network and offers query access for tasks like image classification. The challenge with this design is that MLaaS requires the client to reveal their potentially sensitive queries to the company hosting the model. Multi-party computation (MPC) protects the client's data by allowing encrypted inferences. However, current approaches suffer from prohibitively large inference times. The inference time bottleneck in MPC is the evaluation of non-linear layers such as ReLU activation functions. Motivated by the success of previous work co-designing machine learning and MPC, we develop an activation function co-design. We replace all ReLUs with a polynomial approximation and evaluate them with single-round MPC protocols, which give state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lucasfenaux/pillar-espn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques

Methodstravel james