Fast Rates for Nonparametric Online Learning: From Realizability to   Learning in Games

Constantinos Daskalakis; Noah Golowich

arXiv:2111.08911·cs.LG·April 13, 2022

Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games

Constantinos Daskalakis, Noah Golowich

PDF

Open Access

TL;DR

This paper develops new algorithms for nonparametric online learning that achieve near-optimal convergence rates and applies these results to learning in games, improving upon previous bounds.

Contribution

It introduces a randomized proper learning algorithm with near-optimal loss bounds and applies it to general-sum binary games, achieving improved regret bounds.

Findings

01

Proper learners achieve near-optimal cumulative loss in nonparametric online regression.

02

New regret bounds of (d^{3/4} \, T^{1/4}) for players in general-sum binary games.

03

Hierarchical aggregation and stability techniques are introduced for nonparametric online learning.

Abstract

We study fast rates of convergence in the setting of nonparametric online regression, namely where regret is defined with respect to an arbitrary function class which has bounded complexity. Our contributions are two-fold: - In the realizable setting of nonparametric online regression with the absolute loss, we propose a randomized proper learning algorithm which gets a near-optimal cumulative loss in terms of the sequential fat-shattering dimension of the hypothesis class. In the setting of online classification with a class of Littlestone dimension $d$ , our bound reduces to $d \cdot poly lo g T$ . This result answers a question as to whether proper learners could achieve near-optimal cumulative loss; previously, even for online classification, the best known cumulative loss was $\tilde{O} (d T)$ . Further, for the real-valued (regression) setting, a cumulative loss bound…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Data Stream Mining Techniques