Overcoming Catastrophic Forgetting via Direction-Constrained Optimization
Yunfei Teng, Anna Choromanska, Murray Campbell, Songtao Lu, Parikshit, Ram, Lior Horesh

TL;DR
This paper introduces Direction-Constrained Optimization (DCO), a novel method for continual learning that constrains model parameters within task-specific cones to prevent catastrophic forgetting, showing improved performance over existing methods.
Contribution
The paper proposes DCO, a new regularization-based continual learning algorithm using autoencoders to approximate and constrain top principal directions, reducing forgetting.
Findings
DCO outperforms state-of-the-art regularization methods.
Autoencoders effectively identify task-specific principal directions.
Memory-efficient DCO-COMP maintains performance with fixed memory size.
Abstract
This paper studies a new design of the optimization algorithm for training deep learning models with a fixed architecture of the classification network in a continual learning framework. The training data is non-stationary and the non-stationarity is imposed by a sequence of distinct tasks. We first analyze a deep model trained on only one learning task in isolation and identify a region in network parameter space, where the model performance is close to the recovered optimum. We provide empirical evidence that this region resembles a cone that expands along the convergence direction. We study the principal directions of the trajectory of the optimizer after convergence and show that traveling along a few top principal directions can quickly bring the parameters outside the cone but this is not the case for the remaining directions. We argue that catastrophic forgetting in a continual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
MethodsSolana Customer Service Number +1-833-534-1729
