Restricted Orthogonal Gradient Projection for Continual Learning

Zeyuan Yang; Zonghan Yang; Peng Li; Yang Liu

arXiv:2301.12131·cs.LG·January 31, 2023

Restricted Orthogonal Gradient Projection for Continual Learning

Zeyuan Yang, Zonghan Yang, Peng Li, Yang Liu

PDF

Open Access

TL;DR

This paper introduces ROGO, a novel gradient projection method for continual learning that improves forward knowledge transfer using a fixed network, without additional data buffers or parameters.

Contribution

ROGO employs a restricted orthogonal constraint to enhance forward transfer in continual learning with a fixed architecture, providing theoretical guarantees.

Findings

01

ROGO outperforms several strong baselines in experiments.

02

The framework requires no data buffers or extra parameters.

03

Theoretical guarantees support the effectiveness of the relaxing strategy.

Abstract

Continual learning aims to avoid catastrophic forgetting and effectively leverage learned experiences to master new knowledge. Existing gradient projection approaches impose hard constraints on the optimization space for new tasks to minimize interference, which simultaneously hinders forward knowledge transfer. To address this issue, recent methods reuse frozen parameters with a growing network, resulting in high computational costs. Thus, it remains a challenge whether we can improve forward knowledge transfer for gradient projection approaches using a fixed network architecture. In this work, we propose the Restricted Orthogonal Gradient prOjection (ROGO) framework. The basic idea is to adopt a restricted orthogonal constraint allowing parameters optimized in the direction oblique to the whole frozen space to facilitate forward knowledge transfer while consolidating previous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Indoor and Outdoor Localization Technologies