Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning
Christopher A. Choquette-Choo, H. Brendan McMahan, Keith Rush, and, Abhradeep Thakurta

TL;DR
This paper presents advanced differentially private matrix factorization techniques for gradient-based machine learning, significantly enhancing privacy-utility tradeoffs over multiple epochs with efficient algorithms and broad applicability.
Contribution
Introduces novel DP mechanisms for multi-epoch gradient-based ML, extending online matrix factorization to adaptive streams, with efficient Fourier-transform-based methods and broad linear query applicability.
Findings
Significant privacy-utility improvements over DP-SGD.
Efficient Fourier-transform-based mechanism with minimal utility loss.
Effective for large-scale ML tasks like image classification and language modeling.
Abstract
We introduce new differentially private (DP) mechanisms for gradient-based machine learning (ML) with multiple passes (epochs) over a dataset, substantially improving the achievable privacy-utility-computation tradeoffs. We formalize the problem of DP mechanisms for adaptive streams with multiple participations and introduce a non-trivial extension of online matrix factorization DP mechanisms to our setting. This includes establishing the necessary theory for sensitivity calculations and efficient computation of optimal matrices. For some applications like SGD steps, applying these optimal techniques becomes computationally expensive. We thus design an efficient Fourier-transform-based mechanism with only a minor utility loss. Extensive empirical evaluation on both example-level DP for image classification and user-level DP for language modeling demonstrate substantial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Error Correcting Code Techniques
MethodsStochastic Gradient Descent
