Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression
Xiaoyun Li, Ping Li

TL;DR
This paper analyzes the impact of biased gradient compression with error feedback in federated non-convex optimization, revealing convergence properties and limitations under partial client participation.
Contribution
It provides a comprehensive theoretical analysis of error feedback in federated learning with biased compression, including new insights under partial participation scenarios.
Findings
Fed-EF matches full-precision convergence rates with linear speedup.
Partial participation introduces a slow-down due to stale error accumulation.
Two-way compression does not affect convergence results.
Abstract
In federated learning (FL) systems, e.g., wireless networks, the communication cost between the clients and the central server can often be a bottleneck. To reduce the communication cost, the paradigm of communication compression has become a popular strategy in the literature. In this paper, we focus on biased gradient compression techniques in non-convex FL problems. In the classical setting of distributed learning, the method of error feedback (EF) is a common technique to remedy the downsides of biased gradient compression. In this work, we study a compressed FL scheme equipped with error feedback, named Fed-EF. We further propose two variants: Fed-EF-SGD and Fed-EF-AMS, depending on the choice of the global model optimizer. We provide a generic theoretical analysis, which shows that directly applying biased compression in FL leads to a non-vanishing bias in the convergence rate.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBone and Joint Diseases · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data
