FFNB: Forgetting-Free Neural Blocks for Deep Continual Visual Learning
Hichem Sahbi, Haoming Zhan

TL;DR
This paper introduces FFNB, a novel dynamic neural network architecture designed for continual learning that prevents forgetting by constraining parameters in the null-space of previous tasks, achieving effective incremental learning.
Contribution
The paper proposes a new forgetting-free neural block (FFNB) with a training procedure that constrains parameters in the null-space, combining Fisher discriminant analysis and Bayesian optimality for continual learning.
Findings
High effectiveness demonstrated on challenging classification tasks.
Outperforms existing methods in mitigating catastrophic forgetting.
Enables incremental end-to-end fine-tuning for improved performance.
Abstract
Deep neural networks (DNNs) have recently achieved a great success in computer vision and several related fields. Despite such progress, current neural architectures still suffer from catastrophic interference (a.k.a. forgetting) which obstructs DNNs to learn continually. While several state-of-the-art methods have been proposed to mitigate forgetting, these existing solutions are either highly rigid (as regularization) or time/memory demanding (as replay). An intermediate class of methods, based on dynamic networks, has been proposed in the literature and provides a reasonable balance between task memorization and computational footprint. In this paper, we devise a dynamic network architecture for continual learning based on a novel forgetting-free neural block (FFNB). Training FFNB features on new tasks is achieved using a novel procedure that constrains the underlying parameters in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
