Improved Impossible Tuning and Lipschitz-Adaptive Universal Online Learning with Gradient Variations
Kei Takemura, Ryuta Matsuno, Keita Sakuma

TL;DR
This paper introduces a novel online learning algorithm that adaptively handles gradient variations, function curvature, and gradient scales, achieving optimal regret bounds and resolving longstanding issues in universal online learning.
Contribution
It proposes an optimistic online mirror descent algorithm with a large initial learning rate, enabling simultaneous adaptivity to multiple problem characteristics in online learning.
Findings
Achieves state-of-the-art gradient variation bounds.
Resolves the impossible tuning issue up to log log T factors.
Develops the first universal online learning algorithm with combined adaptivity.
Abstract
A central goal in online learning is to achieve adaptivity to unknown problem characteristics, such as environmental changes captured by gradient variation (GV), function curvature (universal online learning, UOL), and gradient scales (Lipschitz adaptivity, LA). Simultaneously achieving these with optimal performance is a major challenge, partly due to limitations in algorithms for prediction with expert advice. These algorithms often serve as meta-algorithms in online ensemble frameworks, and their sub-optimality hinders overall UOL performance. Specifically, existing algorithms addressing the ``impossible tuning'' issue incur an excess factor in their regret bound compared to the lower bound. To solve this problem, we propose a novel optimistic online mirror descent algorithm with an auxiliary initial round using large learning rates. This design enables a refined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Advanced Adaptive Filtering Techniques · Distributed Sensor Networks and Detection Algorithms
