Scaling up the Banded Matrix Factorization Mechanism for Differentially Private ML
Ryan McKenna

TL;DR
This paper introduces scalable techniques for the DP-BandMF mechanism, enabling it to handle large-scale models and training iterations in differentially private machine learning without significant utility loss.
Contribution
We develop methods to significantly scale up DP-BandMF, overcoming previous limitations in handling large models and many training iterations.
Findings
Enabled DP-BandMF to handle models with over 10^7 parameters.
Supported training with over 10^4 iterations.
Maintained utility with negligible degradation.
Abstract
Correlated noise mechanisms such as DP Matrix Factorization (DP-MF) have proven to be effective alternatives to DP-SGD in large-epsilon few-epoch training regimes. Significant work has been done to find the best correlated noise strategies, and the current state-of-the-art approach is DP-BandMF, which optimally balances the benefits of privacy amplification and noise correlation. Despite it's utility advantages, severe scalability limitations prevent this mechanism from handling large-scale training scenarios where the number of training iterations may exceed and the number of model parameters may exceed . In this work, we present techniques to scale up DP-BandMF along these two dimensions, significantly extending it's reach and enabling it to handle settings with virtually any number of model parameters and training iterations, with negligible utility degradation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCooperative Communication and Network Coding
