Adaptive Online Learning in Dynamic Environments

Lijun Zhang; Shiyin Lu; Zhi-Hua Zhou

arXiv:1810.10815·cs.LG·October 26, 2018·52 cites

Adaptive Online Learning in Dynamic Environments

Lijun Zhang, Shiyin Lu, Zhi-Hua Zhou

PDF

Open Access

TL;DR

This paper introduces Ader, an adaptive online learning algorithm that achieves optimal dynamic regret bounds in changing environments, improving efficiency and extending to models with available dynamical sequences.

Contribution

The paper proposes Ader, a novel adaptive online learning method that attains the optimal dynamic regret bound and reduces gradient evaluations, also extending to dynamical models.

Findings

01

Ader achieves the optimal $O( ext{sqrt}(T(1+P_T)))$ dynamic regret.

02

The improved Ader reduces gradient evaluations from $O( ext{log} T)$ to 1.

03

Ader can incorporate sequences of dynamical models for better adaptation.

Abstract

In this paper, we study online convex optimization in dynamic environments, and aim to bound the dynamic regret with respect to any sequence of comparators. Existing work have shown that online gradient descent enjoys an $O (T (1 + P_{T}))$ dynamic regret, where $T$ is the number of iterations and $P_{T}$ is the path-length of the comparator sequence. However, this result is unsatisfactory, as there exists a large gap from the $Ω (T (1 + P_{T}))$ lower bound established in our paper. To address this limitation, we develop a novel online method, namely adaptive learning for dynamic environment (Ader), which achieves an optimal $O (T (1 + P_{T}))$ dynamic regret. The basic idea is to maintain a set of experts, each attaining an optimal dynamic regret for a specific path-length, and combines them with an expert-tracking algorithm. Furthermore, we propose an improved Ader based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Online Learning and Analytics