EF21: A New, Simpler, Theoretically Better, and Practically Faster Error   Feedback

Peter Richt\'arik; Igor Sokolov; Ilyas Fatkhullin

arXiv:2106.05203·cs.LG·June 10, 2021·40 cites

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

Peter Richt\'arik, Igor Sokolov, Ilyas Fatkhullin

PDF

Open Access 1 Video

TL;DR

EF21 introduces a simplified and theoretically sound error feedback mechanism for distributed training, outperforming previous methods in convergence speed and applicability without relying on strong assumptions.

Contribution

We propose EF21, a new error feedback method with improved theoretical guarantees and practical performance, applicable to heterogeneous data and nonconvex problems.

Findings

01

EF21 achieves an $O(1/T)$ convergence rate for nonconvex problems.

02

EF21 attains a linear convergence rate for PL functions.

03

EF21 outperforms previous EF methods in practice and theory.

Abstract

Error feedback (EF), also known as error compensation, is an immensely popular convergence stabilization mechanism in the context of distributed training of supervised machine learning models enhanced by the use of contractive communication compression mechanisms, such as Top- $k$ . First proposed by Seide et al (2014) as a heuristic, EF resisted any theoretical understanding until recently [Stich et al., 2018, Alistarh et al., 2018]. However, all existing analyses either i) apply to the single node setting only, ii) rely on very strong and often unreasonable assumptions, such global boundedness of the gradients, or iterate-dependent assumptions that cannot be checked a-priori and may not hold in practice, or iii) circumvent these issues via the introduction of additional unbiased compressors, which increase the communication cost. In this work we fix all these deficiencies by proposing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data