Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity
Xun Qian, Hanze Dong, Tong Zhang, Peter Richt\'arik

TL;DR
This paper introduces catalyst-accelerated error-compensated methods for distributed learning, achieving improved communication complexity by matching the best known dependence on compressor parameters, thus enhancing efficiency in communication compression techniques.
Contribution
The paper develops new accelerated error-compensated algorithms using catalyst acceleration, improving the dependence on compressor parameters to match non-accelerated methods.
Findings
Achieves near-optimal dependence on compressor parameters
Provides theoretical guarantees for accelerated error-compensated methods
Enhances communication efficiency in distributed learning
Abstract
Communication overhead is well known to be a key bottleneck in large scale distributed learning, and a particularly successful class of methods which help to overcome this bottleneck is based on the idea of communication compression. Some of the most practically effective gradient compressors, such as TopK, are biased, which causes convergence issues unless one employs a well designed {\em error compensation/feedback} mechanism. Error compensation is therefore a fundamental technique in the distributed learning literature. In a recent development, Qian et al (NeurIPS 2021) showed that the error-compensation mechanism can be combined with acceleration/momentum, which is another key and highly successful optimization technique. In particular, they developed the error-compensated loop-less Katyusha (ECLK) method, and proved an accelerated linear rate in the strongly convex case. However,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Neural Networks and Applications
