Efficient Meta-Learning for Continual Learning with Taylor Expansion   Approximation

Xiaohan Zou; Tong Lin

arXiv:2210.00713·cs.LG·October 4, 2022

Efficient Meta-Learning for Continual Learning with Taylor Expansion Approximation

Xiaohan Zou, Tong Lin

PDF

Open Access

TL;DR

This paper introduces an efficient meta-learning algorithm for continual learning that uses Taylor expansion to adapt regularization and learning rates, effectively reducing catastrophic forgetting while maintaining high computational efficiency.

Contribution

It proposes a novel meta-learning method leveraging Taylor expansion for importance estimation, avoiding second-order derivatives, and employing Proximal Gradient Descent for improved efficiency.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Achieves comparable or better accuracy with higher computational efficiency.

03

Effectively mitigates catastrophic forgetting in continual learning scenarios.

Abstract

Continual learning aims to alleviate catastrophic forgetting when handling consecutive tasks under non-stationary distributions. Gradient-based meta-learning algorithms have shown the capability to implicitly solve the transfer-interference trade-off problem between different examples. However, they still suffer from the catastrophic forgetting problem in the setting of continual learning, since the past data of previous tasks are no longer available. In this work, we propose a novel efficient meta-learning algorithm for solving the online continual learning problem, where the regularization terms and learning rates are adapted to the Taylor approximation of the parameter's importance to mitigate forgetting. The proposed method expresses the gradient of the meta-loss in closed-form and thus avoid computing second-order derivative which is computationally inhibitable. We also use…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications