Distributed Learning, Communication Complexity and Privacy

Maria-Florina Balcan; Avrim Blum; Shai Fine; and Yishay Mansour

arXiv:1204.3514·cs.LG·May 28, 2012

Distributed Learning, Communication Complexity and Privacy

Maria-Florina Balcan, Avrim Blum, Shai Fine, and Yishay Mansour

PDF

1 Video

TL;DR

This paper investigates the communication complexity of PAC-learning from distributed data, providing bounds, algorithms, and privacy considerations, with applications to various concept classes and learning settings.

Contribution

It introduces new bounds and algorithms for distributed learning, incorporating concepts like teaching-dimension and mistake-bound, and explores privacy aspects in this context.

Findings

01

Tight bounds for common concept classes like conjunctions and parity functions.

02

Efficient distributed Perceptron algorithm for linear separators under certain distributions.

03

A generic boosting approach achieving logarithmic communication dependence on 1/epsilon.

Abstract

We consider the problem of PAC-learning from distributed data and analyze fundamental communication complexity questions involved. We provide general upper and lower bounds on the amount of communication needed to learn well, showing that in addition to VC-dimension and covering number, quantities such as the teaching-dimension and mistake-bound of a class play an important role. We also present tight results for a number of common concept classes including conjunctions, parity functions, and decision lists. For linear separators, we show that for non-concentrated distributions, we can use a version of the Perceptron algorithm to learn with much less communication than the number of updates given by the usual margin bound. We also show how boosting can be performed in a generic manner in the distributed setting to achieve communication with only logarithmic dependence on 1/epsilon for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Distributed Learning, Communication Complexity, and Privacy· youtube