Revisiting Neural Networks for Continual Learning: An Architectural Perspective
Aojun Lu, Tao Feng, Hangjie Yuan, Xiaotian Song, Yanan Sun

TL;DR
This paper investigates how different neural network architectures influence continual learning performance, providing insights and a method to craft more effective, compact CL architectures like AlexAC and ResAC, achieving state-of-the-art results.
Contribution
It systematically analyzes the impact of architectural choices on CL and introduces ArchCraft, a search space and method for designing CL-friendly, compact neural networks.
Findings
Architectural design significantly affects CL performance.
ArchCraft produces architectures that are more parameter-efficient.
Achieves state-of-the-art CL results with highly compact models.
Abstract
Efforts to overcome catastrophic forgetting have primarily centered around developing more effective Continual Learning (CL) methods. In contrast, less attention was devoted to analyzing the role of network architecture design (e.g., network depth, width, and components) in contributing to CL. This paper seeks to bridge this gap between network architecture design and CL, and to present a holistic study on the impact of network architectures on CL. This work considers architecture design at the network scaling level, i.e., width and depth, and also at the network components, i.e., skip connections, global pooling layers, and down-sampling. In both cases, we first derive insights through systematically exploring how architectural designs affect CL. Then, grounded in these insights, we craft a specialized search space for CL and further propose a simple yet effective ArchCraft method to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Cognitive Science and Education Research
