An Effective Dynamic Gradient Calibration Method for Continual Learning
Weichen Lin, Jiaxiang Chen, Ruomin Huang, Hu Ding

TL;DR
This paper introduces a dynamic gradient calibration method for continual learning that aims to mitigate catastrophic forgetting by adjusting gradients during training, inspired by variance reduction techniques, and can enhance existing methods.
Contribution
The paper proposes a novel gradient calibration algorithm for continual learning, inspired by variance reduction, that improves performance and can be integrated with existing methods.
Findings
Improved performance on benchmark datasets
Effective reduction of catastrophic forgetting
Compatible with multiple existing CL methods
Abstract
Continual learning (CL) is a fundamental topic in machine learning, where the goal is to train a model with continuously incoming data and tasks. Due to the memory limit, we cannot store all the historical data, and therefore confront the ``catastrophic forgetting'' problem, i.e., the performance on the previous tasks can substantially decrease because of the missing information in the latter period. Though a number of elegant methods have been proposed, the catastrophic forgetting phenomenon still cannot be well avoided in practice. In this paper, we study the problem from the gradient perspective, where our aim is to develop an effective algorithm to calibrate the gradient in each updating step of the model; namely, our goal is to guide the model to be updated in the right direction under the situation that a large amount of historical data are unavailable. Our idea is partly inspired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications
MethodsSparse Evolutionary Training
