Continual Learning Using a Kernel-Based Method Over Foundation Models

Saleh Momeni; Sahisnu Mazumder; Bing Liu

arXiv:2412.15571·cs.LG·December 23, 2024

Continual Learning Using a Kernel-Based Method Over Foundation Models

Saleh Momeni, Sahisnu Mazumder, Bing Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces KLDA, a kernel-based continual learning method that leverages foundation model features with RBF kernels to effectively prevent catastrophic forgetting and inter-task class separation issues, achieving near joint training accuracy.

Contribution

The paper proposes KLDA, a novel kernel-based method for class-incremental learning that enhances foundation model features with RBF kernels and RFF, outperforming existing methods without replay data.

Findings

01

KLDA outperforms baseline methods on text and image datasets.

02

KLDA achieves accuracy comparable to joint training without replay data.

03

The method effectively mitigates catastrophic forgetting and class separation issues.

Abstract

Continual learning (CL) learns a sequence of tasks incrementally. This paper studies the challenging CL setting of class-incremental learning (CIL). CIL has two key challenges: catastrophic forgetting (CF) and inter-task class separation (ICS). Despite numerous proposed methods, these issues remain persistent obstacles. This paper proposes a novel CIL method, called Kernel Linear Discriminant Analysis (KLDA), that can effectively avoid CF and ICS problems. It leverages only the powerful features learned in a foundation model (FM). However, directly using these features proves suboptimal. To address this, KLDA incorporates the Radial Basis Function (RBF) kernel and its Random Fourier Features (RFF) to enhance the feature representations from the FM, leading to improved performance. When a new task arrives, KLDA computes only the mean for each class in the task and updates a shared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salehmomeni/klda
pytorchOfficial

Videos

Continual Learning Using a Kernel-Based Method Over Foundation Models· underline

Taxonomy

TopicsNeural Networks and Applications · Speech Recognition and Synthesis · Speech and Audio Processing