Distribution-Level Memory Recall for Continual Learning: Preserving   Knowledge and Avoiding Confusion

Shaoxu Cheng; Kanglei Geng; Chiyuan He; Zihuan Qiu; Linfeng Xu; Heqian; Qiu; Lanxiao Wang; Qingbo Wu; Fanman Meng; Hongliang Li

arXiv:2408.02695·cs.LG·August 7, 2024

Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion

Shaoxu Cheng, Kanglei Geng, Chiyuan He, Zihuan Qiu, Linfeng Xu, Heqian, Qiu, Lanxiao Wang, Qingbo Wu, Fanman Meng, Hongliang Li

PDF

Open Access

TL;DR

This paper introduces the Distribution-Level Memory Recall (DMR) method for continual learning, which models old knowledge distributions with Gaussian mixtures to generate more accurate pseudo features, reducing confusion and improving knowledge retention.

Contribution

The paper proposes a novel distribution-level pseudo feature generation approach using Gaussian mixtures, and introduces methods to handle multimodal imbalance and quantify confusion in continual learning.

Findings

01

DMR outperforms prototype-based methods in preserving old knowledge.

02

IGIM effectively mitigates multimodal imbalance and confusion.

03

IMFE enhances pseudo features, reducing classification errors.

Abstract

Continual Learning (CL) aims to enable Deep Neural Networks (DNNs) to learn new data without forgetting previously learned knowledge. The key to achieving this goal is to avoid confusion at the feature level, i.e., avoiding confusion within old tasks and between new and old tasks. Previous prototype-based CL methods generate pseudo features for old knowledge replay by adding Gaussian noise to the centroids of old classes. However, the distribution in the feature space exhibits anisotropy during the incremental process, which prevents the pseudo features from faithfully reproducing the distribution of old knowledge in the feature space, leading to confusion in classification boundaries within old tasks. To address this issue, we propose the Distribution-Level Memory Recall (DMR) method, which uses a Gaussian mixture model to precisely fit the feature distribution of old knowledge at the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsMixup