TL;DR
This paper introduces SR$^2$-LoRA, a method that constrains inter-layer relation drift in pre-trained models to mitigate catastrophic forgetting in class-incremental learning, showing improved performance on benchmarks.
Contribution
It proposes a novel relation alignment approach for PEFT models that enhances robustness against forgetting during incremental learning.
Findings
SR$^2$-LoRA effectively reduces catastrophic forgetting in CIL tasks.
The method's advantages increase with the number of tasks.
Theoretical analysis confirms robustness of relation singular value alignment.
Abstract
Pre-trained models with parameter-efficient fine-tuning (PEFT) have demonstrated promising potential for class-incremental learning (CIL), yet catastrophic forgetting still persists when adapting models to new tasks. In this paper, we present a novel perspective on catastrophic forgetting through the analysis of inter-layer relation drift, i.e., the progressive disruption of relationships among layer-wise representations during the learning of new tasks. We theoretically show that the increase of such drift reduces the classification margins of previously learned tasks, thereby degrading overall model performance. To address this issue, we propose \underline{S}elf-\underline{R}ectifying inter-layer \underline{R}elation Low-Rank Adaptation~(SR-LoRA), a simple yet effective method that mitigates catastrophic forgetting by constraining inter-layer relation drift. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
