Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu, Zhong Ji, YunLong Yu, Jiale Cao, Yanwei Pang, Jungong Han, Xuelong Li

TL;DR
This paper uses Neural Tangent Kernel theory to analyze and improve parameter-efficient fine-tuning methods for continual learning, addressing test-time forgetting and proposing a new framework that enhances performance and theoretical understanding.
Contribution
It introduces NTK-CL, a novel framework that reduces task-specific parameters and improves continual learning by leveraging NTK analysis for better feature representation and generalization.
Findings
NTK-CL triples feature representation, reducing generalization gaps.
Theoretical and empirical results show improved performance on benchmarks.
Adaptive mechanisms and feature orthogonality constraints enhance continual learning.
Abstract
Parameter-efficient fine-tuning for continual learning (PEFT-CL) has shown promise in adapting pre-trained models to sequential tasks while mitigating catastrophic forgetting problem. However, understanding the mechanisms that dictate continual performance in this paradigm remains elusive. To unravel this mystery, we undertake a rigorous analysis of PEFT-CL dynamics to derive relevant metrics for continual scenarios using Neural Tangent Kernel (NTK) theory. With the aid of NTK as a mathematical analysis tool, we recast the challenge of test-time forgetting into the quantifiable generalization gaps during training, identifying three key factors that influence these gaps and the performance of PEFT-CL: training sample size, task-level feature orthogonality, and regularization. To address these challenges, we introduce NTK-CL, a novel framework that eliminates task-specific parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsNeural Tangent Kernel
