A Unified and General Framework for Continual Learning

Zhenyi Wang; Yan Li; Li Shen; Heng Huang

arXiv:2403.13249·cs.LG·March 21, 2024·3 cites

A Unified and General Framework for Continual Learning

Zhenyi Wang, Yan Li, Li Shen, Heng Huang

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a comprehensive framework for Continual Learning that unifies existing methods, reveals their shared mathematical structures, and proposes a novel refresh learning technique inspired by neuroscience to improve knowledge retention.

Contribution

It presents a unified optimization framework for various CL methods and introduces refresh learning, a new concept that enhances continual learning by unlearning and relearning information.

Findings

01

Shared mathematical structures among diverse CL methods

02

Refresh learning improves CL performance

03

Framework effectively unifies existing approaches

Abstract

Continual Learning (CL) focuses on learning from dynamic and changing data distributions while retaining previously acquired knowledge. Various methods have been developed to address the challenge of catastrophic forgetting, including regularization-based, Bayesian-based, and memory-replay-based techniques. However, these methods lack a unified framework and common terminology for describing their approaches. This research aims to bridge this gap by introducing a comprehensive and overarching framework that encompasses and reconciles these existing methodologies. Notably, this new framework is capable of encompassing established CL approaches as special instances within a unified and general optimization objective. An intriguing finding is that despite their diverse origins, these methods share common mathematical structures. This observation highlights the compatibility of these…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 3

Strengths

- The unified framework provided by the authors is interesting and sheds light on particular characteristics of existing algorithms. - The main idea behind the proposed "refresh learning" algorithm seems to be reasonable. - The authors integrated their "refresh learning" approach to existing algorithms showing empirical results on three different datasets.

Weaknesses

- The first part of the paper (unified framework) seems to be unrelated to the second one (Refresh learning algorithm). - The refresh learning objective eqs (12), (13) is insufficiently motivated. - It's not clear why unlearning should be enforced on the current batch. The authors do not provide any motivation. - The conversion of the constraint to a PDE is not obvious to me and is not sufficiently explained in the paper. - The refresh learning algorithm ends up in a preconditioned ascent foll

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

The paper presents a unified framework for different continual learning methods, and derived a new CL approach from the unified objective. The paper presents detailed theoretical derivations of the algorithm and comprehensive experimental results to demonstrate the advantage of the new method of refresh learning.

Weaknesses

For me the more interesting and novel part of the paper is the refresh learning method, its derivation, intuition behind it, and its performance, while the part where the unified approach corresponds to different CL methods in different setup is more expected and easier to follow. I would recommend the authors shorten the part of how the unified objective corresponds to different special cases and further elaborate on refresh learning.

Reviewer 03Rating 3· reject, not good enoughConfidence 4

Strengths

Omitted.

Weaknesses

In my opinion, the paper has the following weaknesses: - The proposed framework is not very interesting. Basically, it says that prior work has loss functions $L_1,L_2,L_3$, therefore let us propose a unified objective $\alpha_1L_1 + \alpha_2 L_2 + \alpha_3L_3$. In my eyes this proposal is incremental and has limited novelty. - The work is not solid and there are issues with writing. For example: - The first part of the paper (pages 1-5, until section 3.3), appears to be a review of prior w

Code & Models

Repositories

joey-wang123/cl-refresh-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProblem and Project Based Learning · Education and Critical Thinking Development · Innovative Teaching Methods