Parallel Online Learning

Daniel Hsu; Nikos Karampatziakis; John Langford; Alex Smola

arXiv:1103.4204·cs.LG·March 23, 2011

Parallel Online Learning

Daniel Hsu, Nikos Karampatziakis, John Langford, Alex Smola

PDF

Open Access

TL;DR

This paper investigates parallel online learning, analyzing the impact of delayed updates caused by parallelization, and explores feature sharding architectures to balance delay, parallelism, and learning performance.

Contribution

It introduces a feature sharding approach for parallel online learning and analyzes tradeoffs between delay, parallelism, and empirical effectiveness.

Findings

01

Delayed updates can significantly impair learning performance.

02

Feature sharding offers a tradeoff between delay and empirical accuracy.

03

Preliminary empirical results demonstrate potential benefits of the proposed architectures.

Abstract

In this work we study parallelization of online learning, a core primitive in machine learning. In a parallel environment all known approaches for parallel online learning lead to delayed updates, where the model is updated using out-of-date information. In the worst case, or when examples are temporally correlated, delay can have a very adverse effect on the learning algorithm. Here, we analyze and present preliminary empirical results on a set of learning architectures based on a feature sharding approach that present various tradeoffs between delay, degree of parallelism, representation power and empirical performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Online Learning and Analytics · Advanced Bandit Algorithms Research