A Unified and General Framework for Continual Learning
Zhenyi Wang, Yan Li, Li Shen, Heng Huang

TL;DR
This paper introduces a comprehensive framework for Continual Learning that unifies existing methods, reveals their shared mathematical structures, and proposes a novel refresh learning technique inspired by neuroscience to improve knowledge retention.
Contribution
It presents a unified optimization framework for various CL methods and introduces refresh learning, a new concept that enhances continual learning by unlearning and relearning information.
Findings
Shared mathematical structures among diverse CL methods
Refresh learning improves CL performance
Framework effectively unifies existing approaches
Abstract
Continual Learning (CL) focuses on learning from dynamic and changing data distributions while retaining previously acquired knowledge. Various methods have been developed to address the challenge of catastrophic forgetting, including regularization-based, Bayesian-based, and memory-replay-based techniques. However, these methods lack a unified framework and common terminology for describing their approaches. This research aims to bridge this gap by introducing a comprehensive and overarching framework that encompasses and reconciles these existing methodologies. Notably, this new framework is capable of encompassing established CL approaches as special instances within a unified and general optimization objective. An intriguing finding is that despite their diverse origins, these methods share common mathematical structures. This observation highlights the compatibility of these…
Peer Reviews
Decision·ICLR 2024 poster
- The unified framework provided by the authors is interesting and sheds light on particular characteristics of existing algorithms. - The main idea behind the proposed "refresh learning" algorithm seems to be reasonable. - The authors integrated their "refresh learning" approach to existing algorithms showing empirical results on three different datasets.
- The first part of the paper (unified framework) seems to be unrelated to the second one (Refresh learning algorithm). - The refresh learning objective eqs (12), (13) is insufficiently motivated. - It's not clear why unlearning should be enforced on the current batch. The authors do not provide any motivation. - The conversion of the constraint to a PDE is not obvious to me and is not sufficiently explained in the paper. - The refresh learning algorithm ends up in a preconditioned ascent foll
The paper presents a unified framework for different continual learning methods, and derived a new CL approach from the unified objective. The paper presents detailed theoretical derivations of the algorithm and comprehensive experimental results to demonstrate the advantage of the new method of refresh learning.
For me the more interesting and novel part of the paper is the refresh learning method, its derivation, intuition behind it, and its performance, while the part where the unified approach corresponds to different CL methods in different setup is more expected and easier to follow. I would recommend the authors shorten the part of how the unified objective corresponds to different special cases and further elaborate on refresh learning.
Omitted.
In my opinion, the paper has the following weaknesses: - The proposed framework is not very interesting. Basically, it says that prior work has loss functions $L_1,L_2,L_3$, therefore let us propose a unified objective $\alpha_1L_1 + \alpha_2 L_2 + \alpha_3L_3$. In my eyes this proposal is incremental and has limited novelty. - The work is not solid and there are issues with writing. For example: - The first part of the paper (pages 1-5, until section 3.3), appears to be a review of prior w
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProblem and Project Based Learning · Education and Critical Thinking Development · Innovative Teaching Methods
