Towards Redundancy-Free Sub-networks in Continual Learning
Cheng Chen, Jingkuan Song, LianLi Gao, Heng Tao Shen

TL;DR
This paper introduces IBM, a novel method for continual learning that creates redundancy-free sub-networks by leveraging information bottleneck principles, significantly reducing parameters and training time while mitigating catastrophic forgetting.
Contribution
IBM is the first approach to eliminate redundancy within sub-networks in continual learning, improving efficiency and effectiveness over existing parameter isolation methods.
Findings
IBM reduces sub-network parameters by 70%.
IBM decreases training time by 80%.
IBM outperforms state-of-the-art methods in continual learning.
Abstract
Catastrophic Forgetting (CF) is a prominent issue in continual learning. Parameter isolation addresses this challenge by masking a sub-network for each task to mitigate interference with old tasks. However, these sub-networks are constructed relying on weight magnitude, which does not necessarily correspond to the importance of weights, resulting in maintaining unimportant weights and constructing redundant sub-networks. To overcome this limitation, inspired by information bottleneck, which removes redundancy between adjacent network layers, we propose \textbf{\underline{I}nformation \underline{B}ottleneck \underline{M}asked sub-network (IBM)} to eliminate redundancy within sub-networks. Specifically, IBM accumulates valuable information into essential weights to construct redundancy-free sub-networks, not only effectively mitigating CF by freezing the sub-networks but also facilitating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
