TL;DR
SpaceNet introduces an adaptive sparse neural network approach for class incremental learning, effectively reducing catastrophic forgetting and efficiently utilizing fixed model capacity without expansion.
Contribution
It proposes a novel architectural-based method that trains sparse networks from scratch, compresses task-specific connections, and reduces interference, outperforming existing methods in continual learning.
Findings
Outperforms regularization-based methods on benchmarks
Maintains fixed capacity while learning multiple tasks
Reduces memory usage compared to rehearsal methods
Abstract
The continual learning (CL) paradigm aims to enable neural networks to learn tasks continually in a sequential fashion. The fundamental challenge in this learning paradigm is catastrophic forgetting previously learned tasks when the model is optimized for a new task, especially when their data is not accessible. Current architectural-based methods aim at alleviating the catastrophic forgetting problem but at the expense of expanding the capacity of the model. Regularization-based methods maintain a fixed model capacity; however, previous studies showed the huge performance degradation of these methods when the task identity is not available during inference (e.g. class incremental learning scenario). In this work, we propose a novel architectural-based method referred as SpaceNet for class incremental learning scenario where we utilize the available fixed capacity of the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
