Parameter-Level Soft-Masking for Continual Learning

Tatsuya Konishi; Mori Kurokawa; Chihiro Ono; Zixuan Ke; Gyuhak Kim,; Bing Liu

arXiv:2306.14775·cs.LG·June 27, 2023·5 cites

Parameter-Level Soft-Masking for Continual Learning

Tatsuya Konishi, Mori Kurokawa, Chihiro Ono, Zixuan Ke, Gyuhak Kim,, Bing Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces SPG, a novel parameter-level soft-masking technique for continual learning that prevents catastrophic forgetting, enhances knowledge transfer, and reduces network capacity usage, outperforming existing methods.

Contribution

It is the first work to apply parameter-level soft-masking in continual learning, enabling full network use per task while mitigating forgetting and promoting transfer.

Findings

01

SPG effectively prevents catastrophic forgetting.

02

SPG enhances knowledge transfer among similar and dissimilar tasks.

03

SPG reduces network capacity consumption.

Abstract

Existing research on task incremental learning in continual learning has primarily focused on preventing catastrophic forgetting (CF). Although several techniques have achieved learning with no CF, they attain it by letting each task monopolize a sub-network in a shared network, which seriously limits knowledge transfer (KT) and causes over-consumption of the network capacity, i.e., as more tasks are learned, the performance deteriorates. The goal of this paper is threefold: (1) overcoming CF, (2) encouraging KT, and (3) tackling the capacity problem. A novel technique (called SPG) is proposed that soft-masks (partially blocks) parameter updating in training based on the importance of each parameter to old tasks. Each task still uses the full network, i.e., no monopoly of any part of the network by any task, which enables maximum KT and reduction in capacity usage. To our knowledge,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uic-liu-lab/spg
pytorchOfficial

Videos

Parameter-Level Soft-Masking for Continual Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications