TAG: Task-based Accumulated Gradients for Lifelong learning
Pranshu Malviya, Balaraman Ravindran, Sarath Chandar

TL;DR
This paper introduces TAG, a task-aware optimizer that uses accumulated task-specific gradients to improve lifelong learning by reducing forgetting and enhancing transfer, outperforming existing methods on complex datasets.
Contribution
The paper proposes a novel task-aware optimizer that adapts learning rates based on task relatedness using accumulated gradients, advancing lifelong learning techniques.
Findings
Outperforms state-of-the-art lifelong learning methods.
Effectively reduces catastrophic forgetting.
Enables positive backward transfer.
Abstract
When an agent encounters a continual stream of new tasks in the lifelong learning setting, it leverages the knowledge it gained from the earlier tasks to help learn the new tasks better. In such a scenario, identifying an efficient knowledge representation becomes a challenging problem. Most research works propose to either store a subset of examples from the past tasks in a replay buffer, dedicate a separate set of parameters to each task or penalize excessive updates over parameters by introducing a regularization term. While existing methods employ the general task-agnostic stochastic gradient descent update rule, we propose a task-aware optimizer that adapts the learning rate based on the relatedness among tasks. We utilize the directions taken by the parameters during the updates by accumulating the gradients specific to each task. These task-based accumulated gradients act as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
