Gradient Projection onto Historical Descent Directions for Communication-Efficient Federated Learning

Arnaud Descours (UCBL); L\'eonard Deroose; Jan Ramon

arXiv:2511.05593·cs.LG·November 11, 2025

Gradient Projection onto Historical Descent Directions for Communication-Efficient Federated Learning

Arnaud Descours (UCBL), L\'eonard Deroose, Jan Ramon

PDF

Open Access

TL;DR

This paper introduces two algorithms, ProjFL and ProjFL+EF, that improve communication efficiency in federated learning by projecting gradients onto historical descent directions, with proven convergence and comparable accuracy to existing methods.

Contribution

The paper proposes novel gradient projection algorithms for federated learning that reduce communication costs while maintaining convergence guarantees across various convexity settings.

Findings

01

Achieve similar accuracy to baseline methods

02

Significantly reduce communication overhead

03

Convergence guarantees under multiple convexity conditions

Abstract

Federated Learning (FL) enables decentralized model training across multiple clients while optionally preserving data privacy. However, communication efficiency remains a critical bottleneck, particularly for large-scale models. In this work, we introduce two complementary algorithms: ProjFL, designed for unbiased compressors, and ProjFL+EF, tailored for biased compressors through an Error Feedback mechanism. Both methods rely on projecting local gradients onto a shared client-server subspace spanned by historical descent directions, enabling efficient information exchange with minimal communication overhead. We establish convergence guarantees for both algorithms under strongly convex, convex, and non-convex settings. Empirical evaluations on standard FL classification benchmarks with deep neural networks show that ProjFL and ProjFL+EF achieve accuracy comparable to existing baselines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning