New Analysis and Algorithm for Learning with Drifting Distributions

Mehryar Mohri; Andres Munoz Medina

arXiv:1205.4343·cs.LG·August 28, 2012·5 cites

New Analysis and Algorithm for Learning with Drifting Distributions

Mehryar Mohri, Andres Munoz Medina

PDF

Open Access

TL;DR

This paper introduces a new theoretical framework and algorithm for learning in environments where data distributions change over time, providing tighter bounds and practical algorithms with promising experimental results.

Contribution

It offers a novel analysis using discrepancy measures, tighter learning bounds, a generalized online-to-batch conversion, and a new algorithm formulated as a quadratic program.

Findings

01

Tighter learning bounds based on discrepancy and Rademacher complexity.

02

A new algorithm formulated as a quadratic program demonstrating practical benefits.

03

Preliminary experiments show improved performance in drifting distribution scenarios.

Abstract

We present a new analysis of the problem of learning with drifting distributions in the batch setting using the notion of discrepancy. We prove learning bounds based on the Rademacher complexity of the hypothesis set and the discrepancy of distributions both for a drifting PAC scenario and a tracking scenario. Our bounds are always tighter and in some cases substantially improve upon previous ones based on the $L_{1}$ distance. We also present a generalization of the standard on-line to batch conversion to the drifting scenario in terms of the discrepancy and arbitrary convex combinations of hypotheses. We introduce a new algorithm exploiting these learning guarantees, which we show can be formulated as a simple QP. Finally, we report the results of preliminary experiments demonstrating the benefits of this algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Algorithms and Data Compression