Group and Exclusive Sparse Regularization-based Continual Learning of CNNs
Basile Tousside, Janis Mohr, J\"org Frochte

TL;DR
This paper introduces GESCL, a regularization-based continual learning method for CNNs that prevents forgetting and efficiently sparsifies the network, outperforming existing approaches on vision benchmarks.
Contribution
The paper proposes GESCL, a novel regularization technique that stabilizes important filters and promotes sparsity, reducing parameters and computation in continual learning.
Findings
GESCL outperforms state-of-the-art methods in accuracy
GESCL effectively prevents catastrophic forgetting
GESCL reduces network parameters and computational cost
Abstract
We present a regularization-based approach for continual learning (CL) of fixed capacity convolutional neural networks (CNN) that does not suffer from the problem of catastrophic forgetting when learning multiple tasks sequentially. This method referred to as Group and Exclusive Sparsity based Continual Learning (GESCL) avoids forgetting of previous tasks by ensuring the stability of the CNN via a stability regularization term, which prevents filters detected as important for past tasks to deviate too much when learning a new task. On top of that, GESCL makes the network plastic via a plasticity regularization term that leverage the over-parameterization of CNNs to efficiently sparsify the network and tunes unimportant filters making them relevant for future tasks. Doing so, GESCL deals with significantly less parameters and computation compared to CL approaches that either dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis
