Scalable and Order-robust Continual Learning with Additive Parameter   Decomposition

Jaehong Yoon; Saehoon Kim; Eunho Yang; Sung Ju Hwang

arXiv:1902.09432·cs.LG·February 18, 2020·19 cites

Scalable and Order-robust Continual Learning with Additive Parameter Decomposition

Jaehong Yoon, Saehoon Kim, Eunho Yang, Sung Ju Hwang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Additive Parameter Decomposition (APD), a scalable and order-robust continual learning method that effectively prevents catastrophic forgetting and order sensitivity by decomposing task parameters into shared and adaptive components.

Contribution

The paper proposes a novel APD method that decomposes parameters to improve scalability, order-robustness, and efficiency in continual learning, outperforming existing methods.

Findings

01

APD outperforms state-of-the-art methods in accuracy.

02

APD demonstrates robustness to task order variations.

03

APD is computationally efficient and scalable.

Abstract

While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively handle catastrophic forgetting and be efficient to train even with a large number of tasks. Secondly, it needs to tackle the problem of order-sensitivity, where the performance of the tasks largely varies based on the order of the task arrival sequence, as it may cause serious problems where fairness plays a critical role (e.g. medical diagnosis). To tackle these practical challenges, we propose a novel continual learning method that is scalable as well as order-robust, which instead of learning a completely shared set of weights, represents the parameters for each task as a sum of task-shared and sparse task-adaptive parameters. With our Additive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iclr2020-apd/anonymous_iclr2020_apd_code
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications