Efficient Distributed Learning with Sparsity

Jialei Wang; Mladen Kolar; Nathan Srebro; Tong Zhang

arXiv:1605.07991·stat.ML·May 26, 2016·42 cites

Efficient Distributed Learning with Sparsity

Jialei Wang, Mladen Kolar, Nathan Srebro, Tong Zhang

PDF

Open Access

TL;DR

This paper introduces an efficient distributed sparse learning method that reduces communication costs and matches centralized estimation accuracy, suitable for high-dimensional regression and classification tasks.

Contribution

It presents a novel distributed approach that requires minimal communication and achieves estimation error bounds comparable to centralized methods.

Findings

01

Matches centralized estimation error bounds within constant communication rounds

02

Demonstrates strong performance on high-dimensional regression tasks

03

Effective on both simulated and real-world datasets

Abstract

We propose a novel, efficient approach for distributed sparse learning in high-dimensions, where observations are randomly partitioned across machines. Computationally, at each round our method only requires the master machine to solve a shifted ell_1 regularized M-estimation problem, and other workers to compute the gradient. In respect of communication, the proposed approach provably matches the estimation error bound of centralized methods within constant rounds of communications (ignoring logarithmic factors). We conduct extensive experiments on both simulated and real world datasets, and demonstrate encouraging performances on high-dimensional regression and classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Distributed Sensor Networks and Detection Algorithms