Universal Online Learning with Unbounded Losses: Memory Is All You Need
Moise Blanchard, Romain Cosson, Steve Hanneke

TL;DR
This paper proves that for unbounded losses in online learning, having a finite set of possible data points is necessary and sufficient for universal learning, and shows memorization as a simple universal strategy.
Contribution
It resolves Hanneke's open problem by characterizing the processes allowing universal learning with unbounded losses and introduces a simple memorization rule as universally optimal.
Findings
Universal learning is possible iff data has finitely many values.
Memorization rule suffices for universal learning in this setting.
Results extend to non-realizable scenarios with Bayes consistency.
Abstract
We resolve an open problem of Hanneke on the subject of universally consistent online learning with non-i.i.d. processes and unbounded losses. The notion of an optimistically universal learning rule was defined by Hanneke in an effort to study learning theory under minimal assumptions. A given learning rule is said to be optimistically universal if it achieves a low long-run average loss whenever the data generating process makes this goal achievable by some learning rule. Hanneke posed as an open problem whether, for every unbounded loss, the family of processes admitting universal learning are precisely those having a finite number of distinct values almost surely. In this paper, we completely resolve this problem, showing that this is indeed the case. As a consequence, this also offers a dramatically simpler formulation of an optimistically universal learning rule for any unbounded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research
