DLCFT: Deep Linear Continual Fine-Tuning for General Incremental   Learning

Hyounguk Shon; Janghyeon Lee; Seung Hwan Kim; Junmo Kim

arXiv:2208.08112·cs.LG·August 18, 2022

DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning

Hyounguk Shon, Janghyeon Lee, Seung Hwan Kim, Junmo Kim

PDF

Open Access

TL;DR

This paper introduces DLCFT, a continual learning method that fine-tunes pre-trained models using linearization and quadratic regularization, effectively preventing forgetting across various incremental learning scenarios.

Contribution

It proposes a novel linearization-based continual fine-tuning framework with quadratic regularization, improving performance and theoretical understanding of regularization methods in incremental learning.

Findings

01

Pre-trained models can be effectively fine-tuned continually with linearization techniques.

02

Quadratic regularization acts as an optimal policy for continual learning.

03

The method outperforms existing approaches in class-incremental tasks.

Abstract

Pre-trained representation is one of the key elements in the success of modern deep learning. However, existing works on continual learning methods have mostly focused on learning models incrementally from scratch. In this paper, we explore an alternative framework to incremental learning where we continually fine-tune the model from a pre-trained representation. Our method takes advantage of linearization technique of a pre-trained neural network for simple and effective continual learning. We show that this allows us to design a linear model where quadratic parameter regularization method is placed as the optimal continual learning policy, and at the same time enjoying the high performance of neural networks. We also show that the proposed algorithm enables parameter regularization methods to be applied to class-incremental problems. Additionally, we provide a theoretical reason why…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsElastic Weight Consolidation