Error Compensated Loopless SVRG, Quartz, and SDCA for Distributed Optimization
Xun Qian, Hanze Dong, Peter Richt\'arik, Tong Zhang

TL;DR
This paper introduces three new error compensated gradient methods—EC-LSVRG, EC-Quartz, and EC-SDCA—for distributed optimization, achieving improved convergence rates and reduced communication costs through error compensation and gradient compression techniques.
Contribution
The paper proposes and analyzes three novel error compensated methods compatible with contraction compressors, providing theoretical convergence guarantees and demonstrating efficiency through experiments.
Findings
Linear convergence rates for EC-LSVRG in convex and smooth cases.
EC-Quartz and EC-SDCA achieve convergence rates comparable to EC-LSVRG.
Numerical experiments confirm the efficiency of the proposed methods.
Abstract
The communication of gradients is a key bottleneck in distributed training of large scale machine learning models. In order to reduce the communication cost, gradient compression (e.g., sparsification and quantization) and error compensation techniques are often used. In this paper, we propose and study three new efficient methods in this space: error compensated loopless SVRG method (EC-LSVRG), error compensated Quartz (EC-Quartz), and error compensated SDCA (EC-SDCA). Our method is capable of working with any contraction compressor (e.g., TopK compressor), and we perform analysis for convex optimization problems in the composite case and smooth case for EC-LSVRG. We prove linear convergence rates for both cases and show that in the smooth case the rate has a better dependence on the parameter associated with the contraction compressor. Further, we show that in the smooth case, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
