Communication-efficient Algorithm for Distributed Sparse Learning via Two-way Truncation
Jineng Ren, Jarvis Haupt

TL;DR
This paper introduces a communication-efficient distributed algorithm for high-dimensional sparse learning that significantly reduces communication costs while maintaining estimation accuracy comparable to centralized methods.
Contribution
The paper presents a novel Two-way Truncation procedure that reduces communication costs from dependence on dimension to dependence on sparsity, with proven exponential error decay.
Findings
Reduces communication from dimension-based to sparsity-based costs
Estimates converge exponentially to the true parameters
Performs comparably to centralized methods in experiments
Abstract
We propose a communicationally and computationally efficient algorithm for high-dimensional distributed sparse learning. At each iteration, local machines compute the gradient on local data and the master machine solves one shifted regularized minimization problem. The communication cost is reduced from constant times of the dimension number for the state-of-the-art algorithm to constant times of the sparsity number via Two-way Truncation procedure. Theoretically, we prove that the estimation error of the proposed algorithm decreases exponentially and matches that of the centralized method under mild assumptions. Extensive experiments on both simulated data and real data verify that the proposed algorithm is efficient and has performance comparable with the centralized method on solving high-dimensional sparse learning problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Face and Expression Recognition · Sparse and Compressive Sensing Techniques
