Self-Attention Meta-Learner for Continual Learning
Ghada Sokar, Decebal Constantin Mocanu, Mykola Pechenizkiy

TL;DR
This paper introduces SAM, a self-attention meta-learning approach that enhances continual learning by selecting relevant representations to prevent catastrophic forgetting and improve task performance.
Contribution
SAM is a novel method that learns a prior knowledge with attention mechanisms, enabling task-specific representations and reducing interference in continual learning.
Findings
Outperforms state-of-the-art methods on Split CIFAR-10/100 and Split MNIST.
Meta-attention boosts informative features and correct target identification.
Existing methods improve when initialized with SAM.
Abstract
Continual learning aims to provide intelligent agents capable of learning multiple tasks sequentially with neural networks. One of its main challenging, catastrophic forgetting, is caused by the neural networks non-optimal ability to learn in non-stationary distributions. In most settings of the current approaches, the agent starts from randomly initialized parameters and is optimized to master the current task regardless of the usefulness of the learned representation for future tasks. Moreover, each of the future tasks uses all the previously learned knowledge although parts of this knowledge might not be helpful for its learning. These cause interference among tasks, especially when the data of previous tasks is not accessible. In this paper, we propose a new method, named Self-Attention Meta-Learner (SAM), which learns a prior knowledge for continual learning that permits learning a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
