Adaptive Online Learning in Dynamic Environments
Lijun Zhang, Shiyin Lu, Zhi-Hua Zhou

TL;DR
This paper introduces Ader, an adaptive online learning algorithm that achieves optimal dynamic regret bounds in changing environments, improving efficiency and extending to models with available dynamical sequences.
Contribution
The paper proposes Ader, a novel adaptive online learning method that attains the optimal dynamic regret bound and reduces gradient evaluations, also extending to dynamical models.
Findings
Ader achieves the optimal $O( ext{sqrt}(T(1+P_T)))$ dynamic regret.
The improved Ader reduces gradient evaluations from $O( ext{log} T)$ to 1.
Ader can incorporate sequences of dynamical models for better adaptation.
Abstract
In this paper, we study online convex optimization in dynamic environments, and aim to bound the dynamic regret with respect to any sequence of comparators. Existing work have shown that online gradient descent enjoys an dynamic regret, where is the number of iterations and is the path-length of the comparator sequence. However, this result is unsatisfactory, as there exists a large gap from the lower bound established in our paper. To address this limitation, we develop a novel online method, namely adaptive learning for dynamic environment (Ader), which achieves an optimal dynamic regret. The basic idea is to maintain a set of experts, each attaining an optimal dynamic regret for a specific path-length, and combines them with an expert-tracking algorithm. Furthermore, we propose an improved Ader based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Online Learning and Analytics
