Beyond Square Roots: Explicit Memory-Efficient Factorization for Multi-Epoch Private Learning
Nikita P. Kalinin, Aki Rehn, Joel Daniel Andersson, Antti Honkela, Christoph H. Lampert

TL;DR
This paper introduces $\gamma$-BIFR, a new memory-efficient factorization method that improves private learning utility and guarantees across different bandwidth regimes.
Contribution
It unifies existing banded inverse factorizations into a single framework, enhancing performance in low-memory, multi-epoch private training.
Findings
$\gamma$-BIFR improves RMSE and private training performance in low-memory regimes.
Provides tighter theoretical guarantees for multi-participation error.
Outperforms existing factorizations at low bandwidths.
Abstract
Correlated-noise mechanisms are among the most promising approaches for improving the utility of differentially private model training, but rigorous guarantees require explicit, analyzable factorizations, and practical deployment requires memory efficiency. Recent works have developed banded inverse factorizations, which address both requirements by exploiting a banded structure in the correlation matrix. The bandwidth controls the size of the noise buffer used to correlate noise across iterations, and thus governs the tradeoff between utility and memory cost. Existing factorizations highlight this tradeoff: DP-CGD achieves high memory efficiency by using only a one-step noise buffer, but this limits its utility gains, while the banded inverse square root (BISR) factorization exploits larger correlation windows and is asymptotically optimal for large bandwidths but performs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
