Overcoming Growth-Induced Forgetting in Task-Agnostic Continual Learning
Yuqing Zhao, Jiannong Cao, Divya Saxena, Xiaoyun Liu, Changlin Song, Bo Yuan, Julie McCann

TL;DR
This paper introduces SparseGrow, a method that uses gradient and parameter sparsity through layer expansion and gating to mitigate growth-induced forgetting in task-agnostic continual learning, enhancing adaptability and knowledge retention.
Contribution
SparseGrow is a novel approach that controls model growth via sparsity techniques, effectively balancing adaptability and forgetting in task-agnostic continual learning.
Findings
SparseGrow reduces forgetting significantly across datasets.
Controlled layer expansion improves adaptability.
Sparsity techniques enhance knowledge retention.
Abstract
In continual learning (CL), model growth enhances adaptability to new data. However, when model growth is applied improperly, especially in task-agnostic CL, where the entire grown model is used for inference, it can lead to severe degradation of learned knowledge, a problem we term growth-induced forgetting. Most existing methods that adopt model growth to improve adaptability often overlook the forgetting issue, resulting in compromised knowledge retention, making them unsuitable for task-agnostic settings. To promote both adaptability and knowledge retention with model growth, we identify the key: gradient and parameter sparsity. Introducing SparseGrow, which increases gradient sparsity through layer expansion and gradient gating to enable focused updates on parameters while preserving critical parameters, thus inhibiting forgetting. Moreover, it promotes parameter sparsity with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMemory Processes and Influences · Psychological and Educational Research Studies · Higher Education Learning Practices
