Sparse Orthogonal Parameters Tuning for Continual Learning
Kun-Peng Ning, Hai-Jian Ke, Yu-Yang Liu, Jia-Yu Yao, Yong-Hong Tian, Li Yuan

TL;DR
This paper introduces SoTU, a novel method utilizing sparse orthogonal parameters to improve continual learning by effectively merging knowledge from multiple tasks and reducing catastrophic forgetting.
Contribution
The paper proposes SoTU, a new approach that leverages sparse orthogonal parameters for continual learning, demonstrating its effectiveness across various benchmarks.
Findings
SoTU achieves superior performance on multiple CL benchmarks.
It provides optimal feature representations without complex classifiers.
The method effectively merges knowledge from multiple domains.
Abstract
Continual learning methods based on pre-trained models (PTM) have recently gained attention which adapt to successive downstream tasks without catastrophic forgetting. These methods typically refrain from updating the pre-trained parameters and instead employ additional adapters, prompts, and classifiers. In this paper, we from a novel perspective investigate the benefit of sparse orthogonal parameters for continual learning. We found that merging sparse orthogonality of models learned from multiple streaming tasks has great potential in addressing catastrophic forgetting. Leveraging this insight, we propose a novel yet effective method called SoTU (Sparse Orthogonal Parameters TUning). We hypothesize that the effectiveness of SoTU lies in the transformation of knowledge learned from multiple domains into the fusion of orthogonal delta parameters. Experimental evaluations on diverse CL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications · Speech and Audio Processing · Advanced SAR Imaging Techniques
MethodsSoftmax · Attention Is All You Need
