Beyond Square Roots: Explicit Memory-Efficient Factorization for Multi-Epoch Private Learning

Nikita P. Kalinin; Aki Rehn; Joel Daniel Andersson; Antti Honkela; Christoph H. Lampert

arXiv:2605.18379·cs.LG·May 19, 2026

Beyond Square Roots: Explicit Memory-Efficient Factorization for Multi-Epoch Private Learning

Nikita P. Kalinin, Aki Rehn, Joel Daniel Andersson, Antti Honkela, Christoph H. Lampert

PDF

TL;DR

This paper introduces $\gamma$-BIFR, a new memory-efficient factorization method that improves private learning utility and guarantees across different bandwidth regimes.

Contribution

It unifies existing banded inverse factorizations into a single framework, enhancing performance in low-memory, multi-epoch private training.

Findings

01

$\gamma$-BIFR improves RMSE and private training performance in low-memory regimes.

02

Provides tighter theoretical guarantees for multi-participation error.

03

Outperforms existing factorizations at low bandwidths.

Abstract

Correlated-noise mechanisms are among the most promising approaches for improving the utility of differentially private model training, but rigorous guarantees require explicit, analyzable factorizations, and practical deployment requires memory efficiency. Recent works have developed banded inverse factorizations, which address both requirements by exploiting a banded structure in the correlation matrix. The bandwidth controls the size of the noise buffer used to correlate noise across iterations, and thus governs the tradeoff between utility and memory cost. Existing factorizations highlight this tradeoff: DP- $λ$ CGD achieves high memory efficiency by using only a one-step noise buffer, but this limits its utility gains, while the banded inverse square root (BISR) factorization exploits larger correlation windows and is asymptotically optimal for large bandwidths but performs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.