Towards stable training of parallel continual learning
Li Yuepan, Fan Lyu, Yuyang Li, Wei Feng, Guangcan Liu, Fanhua Shang

TL;DR
This paper proposes Stable Parallel Continual Learning (SPCL), a method to improve training stability in multi-source continual learning by orthogonalizing network parameters and gradients, leading to more reliable learning across tasks.
Contribution
The paper introduces SPCL, a novel approach that enhances stability in parallel continual learning through orthogonality constraints and gradient management techniques.
Findings
SPCL outperforms existing methods in training stability.
Orthogonal constraints improve feature disentanglement.
Gradient orthogonalization reduces conflicts across tasks.
Abstract
Parallel Continual Learning (PCL) tasks investigate the training methods for continual learning with multi-source input, where data from different tasks are learned as they arrive. PCL offers high training efficiency and is well-suited for complex multi-source data systems, such as autonomous vehicles equipped with multiple sensors. However, at any time, multiple tasks need to be trained simultaneously, leading to severe training instability in PCL. This instability manifests during both forward and backward propagation, where features are entangled and gradients are conflict. This paper introduces Stable Parallel Continual Learning (SPCL), a novel approach that enhances the training stability of PCL for both forward and backward propagation. For the forward propagation, we apply Doubly-block Toeplit (DBT) Matrix based orthogonality constraints to network parameters to ensure stable and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigher Education Learning Practices
