Subspace Optimization for Efficient Federated Learning under Heterogeneous Data

Shuchen Zhu; Zhengyang Huang; Yuqi Xu; Peijin Li

arXiv:2604.25467·cs.LG·April 29, 2026

Subspace Optimization for Efficient Federated Learning under Heterogeneous Data

Shuchen Zhu, Zhengyang Huang, Yuqi Xu, Peijin Li

PDF

TL;DR

This paper introduces SSF, a subspace optimization method for federated learning that reduces communication overhead while effectively handling non-IID data, achieving competitive convergence rates.

Contribution

The paper proposes a novel subspace optimization approach for federated learning that minimizes communication costs and manages data heterogeneity more efficiently than existing methods.

Findings

01

SSF achieves a convergence rate of (1/T+1/ff(NKT)) under standard assumptions.

02

Experiments demonstrate favorable accuracy-efficiency trade-offs with SSF on heterogeneous data.

03

SSF outperforms existing heterogeneity correction methods in communication efficiency.

Abstract

Federated learning increasingly operates in a large-model regime where communication, memory, and computation are all scarce. Typically, non-IID client data induce drift that degrades the stability and performance of local training. Existing remedies such as SCAFFOLD introduce heterogeneity-correction mechanisms to address this challenge, but they incur substantial extra communication and memory overhead. This paper proposes a subspace optimization method for federated learning (SSF), which performs heterogeneity-corrected optimization in a low-dimensional subspace using only projected quantities, while preserving full-dimensional control information through a backfill-style update that retains residual components whenever the active subspace changes. Under standard smoothness and bounded-variance assumptions, SSF attains a non-asymptotic rate of order…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.