Continual Learning with Global Alignment
Xueying Bai, Jinghuan Shang, Yifan Sun, Niranjan Balasubramanian

TL;DR
This paper introduces a global alignment method for continual learning that promotes appropriate data representation correlations to reduce interference and catastrophic forgetting, achieving state-of-the-art results without experience replay.
Contribution
It proposes a novel global alignment approach that aligns data representations across tasks using pre-trained token representations, improving continual learning performance.
Findings
Achieves SOTA performance without experience replay.
Improves class-incremental learning through task-incremental training.
Reduces catastrophic forgetting by promoting data representation correlations.
Abstract
Continual learning aims to sequentially learn new tasks without forgetting previous tasks' knowledge (catastrophic forgetting). One factor that can cause forgetting is the interference between the gradients on losses from different tasks. When the gradients on the current task's loss are in opposing directions to those on previous tasks' losses, updating the model for the current task may cause performance degradation on previous tasks. In this paper, we first identify causes of the above interference, and hypothesize that correlations between data representations are a key factor of interference. We then propose a method for promoting appropriate correlations between arbitrary tasks' data representations (i.e., global alignment) in individual task learning. Specifically, we learn the data representation as a task-specific composition of pre-trained token representations shared across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
