Banded Square Root Matrix Factorization for Differentially Private Model Training
Nikita P. Kalinin, Christoph Lampert

TL;DR
This paper introduces BSR, a new matrix factorization method that significantly reduces computational costs in differentially private model training, especially for large-scale problems, while maintaining high model performance.
Contribution
The paper proposes BSR, a novel matrix factorization approach that overcomes computational bottlenecks in differentially private training, with analytical solutions for stochastic gradient descent.
Findings
BSR achieves comparable model accuracy to existing methods.
BSR drastically reduces computational overhead.
Theoretical bounds on approximation quality are established.
Abstract
Current state-of-the-art methods for differentially private model training are based on matrix factorization techniques. However, these methods suffer from high computational overhead because they require numerically solving a demanding optimization problem to determine an approximately optimal factorization prior to the actual model training. In this work, we present a new matrix factorization approach, BSR, which overcomes this computational bottleneck. By exploiting properties of the standard matrix square root, BSR allows to efficiently handle also large-scale problems. For the key scenario of stochastic gradient descent with momentum and weight decay, we even derive analytical expressions for BSR that render the computational overhead negligible. We prove bounds on the approximation quality that hold both in the centralized and in the federated learning setting. Our numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Privacy-Preserving Technologies in Data · Model-Driven Software Engineering Techniques
