TRGP: Trust Region Gradient Projection for Continual Learning

Sen Lin; Li Yang; Deliang Fan; Junshan Zhang

arXiv:2202.02931·cs.LG·February 8, 2022·22 cites

TRGP: Trust Region Gradient Projection for Continual Learning

Sen Lin, Li Yang, Deliang Fan, Junshan Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

TRGP introduces a trust region-based gradient projection method for continual learning that selectively reuses old task knowledge, improving transfer and reducing forgetting.

Contribution

It proposes a novel trust region concept for task selection and a scaled weight projection for efficient knowledge reuse in continual learning.

Findings

01

Significantly outperforms state-of-the-art methods

02

Effectively balances knowledge transfer and forgetting

03

Demonstrates robustness across multiple tasks

Abstract

Catastrophic forgetting is one of the major challenges in continual learning. To address this issue, some existing methods put restrictive constraints on the optimization space of the new task for minimizing the interference to old tasks. However, this may lead to unsatisfactory performance for the new task, especially when the new task is strongly correlated with old tasks. To tackle this challenge, we propose Trust Region Gradient Projection (TRGP) for continual learning to facilitate the forward knowledge transfer based on an efficient characterization of task correlation. Particularly, we introduce a notion of `trust region' to select the most related old tasks for the new task in a layer-wise and single-shot manner, using the norm of gradient projection onto the subspace spanned by task inputs. Then, a scaled weight projection is proposed to cleverly reuse the frozen weights of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LYang-666/TRGP
pytorchOfficial

Videos

TRGP: Trust Region Gradient Projection for Continual Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications