Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games
Constantinos Daskalakis, Noah Golowich

TL;DR
This paper develops new algorithms for nonparametric online learning that achieve near-optimal convergence rates and applies these results to learning in games, improving upon previous bounds.
Contribution
It introduces a randomized proper learning algorithm with near-optimal loss bounds and applies it to general-sum binary games, achieving improved regret bounds.
Findings
Proper learners achieve near-optimal cumulative loss in nonparametric online regression.
New regret bounds of (d^{3/4} \, T^{1/4}) for players in general-sum binary games.
Hierarchical aggregation and stability techniques are introduced for nonparametric online learning.
Abstract
We study fast rates of convergence in the setting of nonparametric online regression, namely where regret is defined with respect to an arbitrary function class which has bounded complexity. Our contributions are two-fold: - In the realizable setting of nonparametric online regression with the absolute loss, we propose a randomized proper learning algorithm which gets a near-optimal cumulative loss in terms of the sequential fat-shattering dimension of the hypothesis class. In the setting of online classification with a class of Littlestone dimension , our bound reduces to . This result answers a question as to whether proper learners could achieve near-optimal cumulative loss; previously, even for online classification, the best known cumulative loss was . Further, for the real-valued (regression) setting, a cumulative loss bound…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
