CryptoNAS: Private Inference on a ReLU Budget
Zahra Ghodsi, Akshaj Veldanda, Brandon Reagen, Siddharth Garg

TL;DR
CryptoNAS introduces a neural architecture search method that optimizes models for private inference by focusing on a ReLU budget, significantly improving accuracy and reducing latency in privacy-preserving ML inference.
Contribution
It proposes a novel NAS approach tailored for private inference, emphasizing ReLU operation costs to produce models optimized for privacy-preserving latency and accuracy.
Findings
CryptoNAS improves accuracy by 3.4%.
CryptoNAS achieves 2.4x latency reduction.
Models tailored for PI outperform existing solutions.
Abstract
Machine learning as a service has given raise to privacy concerns surrounding clients' data and providers' models and has catalyzed research in private inference (PI): methods to process inferences without disclosing inputs. Recently, researchers have adapted cryptographic techniques to show PI is possible, however all solutions increase inference latency beyond practical limits. This paper makes the observation that existing models are ill-suited for PI and proposes a novel NAS method, named CryptoNAS, for finding and tailoring models to the needs of PI. The key insight is that in PI operator latency cost are non-linear operations (e.g., ReLU) dominate latency, while linear layers become effectively free. We develop the idea of a ReLU budget as a proxy for inference latency and use CryptoNAS to build models that maximize accuracy within a given budget. CryptoNAS improves accuracy by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsCryptography and Data Security · Privacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting
Methods*Communicated@Fast*How Do I Communicate to Expedia?
