Is Parameter Collision Hindering Continual Learning in LLMs?
Shuo Yang, Kun-Peng Ning, Yu-Yang Liu, Jia-Yu Yao and, Yong-Hong Tian, Yi-Bing Song, Li Yuan

TL;DR
This paper investigates how parameter collision affects continual learning in LLMs, revealing that reducing collisions enhances task orthogonality and knowledge retention, and proposes N-LoRA to improve CL performance.
Contribution
The paper identifies parameter collision as a key factor in CL challenges and introduces N-LoRA, a method that reduces collisions to improve task orthogonality and knowledge retention in LLMs.
Findings
N-LoRA outperforms SOTA methods in CL benchmarks.
N-LoRA achieves +2.9 performance improvement.
N-LoRA reduces parameter collision by 58.1 times.
Abstract
Large Language Models (LLMs) often suffer from catastrophic forgetting when learning multiple tasks sequentially, making continual learning (CL) essential for their dynamic deployment. Existing state-of-the-art (SOTA) methods, such as O-LoRA, typically focus on constructing orthogonality tasks to decouple parameter interdependence from various domains.In this paper, we reveal that building non-collision parameters is a more critical factor in addressing CL challenges. Our theoretical and experimental analyses demonstrate that non-collision parameters can provide better task orthogonality, which is a sufficient but unnecessary condition. Furthermore, knowledge from multiple domains will be preserved in non-collision parameter subspaces, making it more difficult to forget previously seen data. Leveraging this insight, we propose Non-collision Low-Rank Adaptation (N-LoRA), a simple yet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Advanced Data Processing Techniques · Fuzzy Logic and Control Systems
MethodsFocus
