Algorithm Design for Online Meta-Learning with Task Boundary Detection
Daouda Sow, Sen Lin, Yingbin Liang, Junshan Zhang

TL;DR
This paper introduces a novel task-agnostic online meta-learning algorithm capable of detecting task switches and distribution shifts in non-stationary environments, enabling quick adaptation without storing past data.
Contribution
It proposes effective detection mechanisms for task changes and distribution shifts, and develops an online meta-learning algorithm that updates solely based on current data, achieving sublinear regret.
Findings
Significant performance improvements over baseline methods on three benchmarks.
Effective detection of task switches and distribution shifts.
Achieved sublinear task-averaged regret under mild conditions.
Abstract
Online meta-learning has recently emerged as a marriage between batch meta-learning and online learning, for achieving the capability of quick adaptation on new tasks in a lifelong manner. However, most existing approaches focus on the restrictive setting where the distribution of the online tasks remains fixed with known task boundaries. In this work, we relax these assumptions and propose a novel algorithm for task-agnostic online meta-learning in non-stationary environments. More specifically, we first propose two simple but effective detection mechanisms of task switches and distribution shift based on empirical observations, which serve as a key building block for more elegant online model updates in our algorithm: the task switch detection mechanism allows reusing of the best model available for the current task at hand, and the distribution shift detection mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
