Scalable and Order-robust Continual Learning with Additive Parameter Decomposition
Jaehong Yoon, Saehoon Kim, Eunho Yang, Sung Ju Hwang

TL;DR
This paper introduces Additive Parameter Decomposition (APD), a scalable and order-robust continual learning method that effectively prevents catastrophic forgetting and order sensitivity by decomposing task parameters into shared and adaptive components.
Contribution
The paper proposes a novel APD method that decomposes parameters to improve scalability, order-robustness, and efficiency in continual learning, outperforming existing methods.
Findings
APD outperforms state-of-the-art methods in accuracy.
APD demonstrates robustness to task order variations.
APD is computationally efficient and scalable.
Abstract
While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively handle catastrophic forgetting and be efficient to train even with a large number of tasks. Secondly, it needs to tackle the problem of order-sensitivity, where the performance of the tasks largely varies based on the order of the task arrival sequence, as it may cause serious problems where fairness plays a critical role (e.g. medical diagnosis). To tackle these practical challenges, we propose a novel continual learning method that is scalable as well as order-robust, which instead of learning a completely shared set of weights, represents the parameters for each task as a sum of task-shared and sparse task-adaptive parameters. With our Additive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
