Defeating Catastrophic Forgetting via Enhanced Orthogonal Weights Modification
Yanni Li, Bing Liu, Kaicheng Yao, Xiaoli Kou, Pengfan Lv, and Yueshen Xu, Jiangtao Cui

TL;DR
This paper introduces EOWM, an enhanced orthogonal weights modification method, to effectively address catastrophic forgetting in neural networks by analyzing weight gradients, proposing a new strategy, and establishing an upper bound on learnable tasks, with superior experimental results.
Contribution
The paper proposes EOWM, a novel continual learning method that enhances OWM by analyzing weight gradients and establishing an upper bound on learnable tasks.
Findings
EOWM outperforms state-of-the-art CL methods on benchmarks.
The weight gradient depends on input space and previous weights.
An upper bound for sequential task learning is theoretically established.
Abstract
The ability of neural networks (NNs) to learn and remember multiple tasks sequentially is facing tough challenges in achieving general artificial intelligence due to their catastrophic forgetting (CF) issues. Fortunately, the latest OWM Orthogonal Weights Modification) and other several continual learning (CL) methods suggest some promising ways to overcome the CF issue. However, none of existing CL methods explores the following three crucial questions for effectively overcoming the CF issue: that is, what knowledge does it contribute to the effective weights modification of the NN during its sequential tasks learning? When the data distribution of a new learning task changes corresponding to the previous learned tasks, should a uniform/specific weight modification strategy be adopted or not? what is the upper bound of the learningable tasks sequentially for a given CL method? ect. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
