FSC-Net: Fast-Slow Consolidation Networks for Continual Learning
Mohamed El Gorrim

TL;DR
FSC-Net introduces a dual-network architecture inspired by neuroscience to improve continual learning by separating rapid task adaptation from gradual knowledge consolidation, effectively reducing catastrophic forgetting.
Contribution
The paper proposes a simple dual-network model with a fast and slow network, demonstrating that consolidation effectiveness depends more on methodology than architecture, and that pure replay outperforms distillation.
Findings
FSC-Net achieves significant retention improvements on Split-MNIST and Split-CIFAR-10.
Pure replay without distillation yields better performance, challenging common assumptions.
Empirical evidence supports dual-timescale consolidation as key to mitigating forgetting.
Abstract
Continual learning remains challenging due to catastrophic forgetting, where neural networks lose previously acquired knowledge when learning new tasks. Inspired by memory consolidation in neuroscience, we propose FSC-Net (Fast-Slow Consolidation Networks), a dual-network architecture that separates rapid task learning from gradual knowledge consolidation. Our method employs a fast network (NN1) for immediate adaptation to new tasks and a slow network (NN2) that consolidates knowledge through distillation and replay. Within the family of MLP-based NN1 variants we evaluated, consolidation effectiveness is driven more by methodology than architectural embellishments -- a simple MLP outperforms more complex similarity-gated variants by 1.2pp. Through systematic hyperparameter analysis, we observed empirically that pure replay without distillation during consolidation achieves superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
