Towards stable training of parallel continual learning

Li Yuepan; Fan Lyu; Yuyang Li; Wei Feng; Guangcan Liu; Fanhua Shang

arXiv:2407.08214·cs.LG·July 12, 2024

Towards stable training of parallel continual learning

Li Yuepan, Fan Lyu, Yuyang Li, Wei Feng, Guangcan Liu, Fanhua Shang

PDF

Open Access 1 Repo

TL;DR

This paper proposes Stable Parallel Continual Learning (SPCL), a method to improve training stability in multi-source continual learning by orthogonalizing network parameters and gradients, leading to more reliable learning across tasks.

Contribution

The paper introduces SPCL, a novel approach that enhances stability in parallel continual learning through orthogonality constraints and gradient management techniques.

Findings

01

SPCL outperforms existing methods in training stability.

02

Orthogonal constraints improve feature disentanglement.

03

Gradient orthogonalization reduces conflicts across tasks.

Abstract

Parallel Continual Learning (PCL) tasks investigate the training methods for continual learning with multi-source input, where data from different tasks are learned as they arrive. PCL offers high training efficiency and is well-suited for complex multi-source data systems, such as autonomous vehicles equipped with multiple sensors. However, at any time, multiple tasks need to be trained simultaneously, leading to severe training instability in PCL. This instability manifests during both forward and backward propagation, where features are entangled and gradients are conflict. This paper introduces Stable Parallel Continual Learning (SPCL), a novel approach that enhances the training stability of PCL for both forward and backward propagation. For the forward propagation, we apply Doubly-block Toeplit (DBT) Matrix based orthogonality constraints to network parameters to ensure stable and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jasmine-0/spcl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHigher Education Learning Practices