DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning
Hyounguk Shon, Janghyeon Lee, Seung Hwan Kim, Junmo Kim

TL;DR
This paper introduces DLCFT, a continual learning method that fine-tunes pre-trained models using linearization and quadratic regularization, effectively preventing forgetting across various incremental learning scenarios.
Contribution
It proposes a novel linearization-based continual fine-tuning framework with quadratic regularization, improving performance and theoretical understanding of regularization methods in incremental learning.
Findings
Pre-trained models can be effectively fine-tuned continually with linearization techniques.
Quadratic regularization acts as an optimal policy for continual learning.
The method outperforms existing approaches in class-incremental tasks.
Abstract
Pre-trained representation is one of the key elements in the success of modern deep learning. However, existing works on continual learning methods have mostly focused on learning models incrementally from scratch. In this paper, we explore an alternative framework to incremental learning where we continually fine-tune the model from a pre-trained representation. Our method takes advantage of linearization technique of a pre-trained neural network for simple and effective continual learning. We show that this allows us to design a linear model where quadratic parameter regularization method is placed as the optimal continual learning policy, and at the same time enjoying the high performance of neural networks. We also show that the proposed algorithm enables parameter regularization methods to be applied to class-incremental problems. Additionally, we provide a theoretical reason why…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsElastic Weight Consolidation
