Towards Fast and Scalable Private Inference
Jianqiao Mo, Karthik Garimella, Negar Neda, Austin Ebel, Brandon, Reagen

TL;DR
This paper reviews recent advances in privacy-preserving computation for neural network inference, focusing on reducing computational and communication overheads through specialized accelerators like HAAC and RPU.
Contribution
It introduces a comprehensive overview of PPC technologies, characterizes their overheads in private inference, and presents two accelerators that improve efficiency.
Findings
HAAC accelerates garbled circuits for private inference.
RPU accelerates homomorphic encryption computations.
Future work needed to further reduce overheads.
Abstract
Privacy and security have rapidly emerged as first order design constraints. Users now demand more protection over who can see their data (confidentiality) as well as how it is used (control). Here, existing cryptographic techniques for security fall short: they secure data when stored or communicated but must decrypt it for computation. Fortunately, a new paradigm of computing exists, which we refer to as privacy-preserving computation (PPC). Emerging PPC technologies can be leveraged for secure outsourced computation or to enable two parties to compute without revealing either users' secret data. Despite their phenomenal potential to revolutionize user protection in the digital age, the realization has been limited due to exorbitant computational, communication, and storage overheads. This paper reviews recent efforts on addressing various PPC overheads using private inference (PI)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
