An Effective Dynamic Gradient Calibration Method for Continual Learning

Weichen Lin; Jiaxiang Chen; Ruomin Huang; Hu Ding

arXiv:2407.20956·cs.LG·July 31, 2024

An Effective Dynamic Gradient Calibration Method for Continual Learning

Weichen Lin, Jiaxiang Chen, Ruomin Huang, Hu Ding

PDF

Open Access

TL;DR

This paper introduces a dynamic gradient calibration method for continual learning that aims to mitigate catastrophic forgetting by adjusting gradients during training, inspired by variance reduction techniques, and can enhance existing methods.

Contribution

The paper proposes a novel gradient calibration algorithm for continual learning, inspired by variance reduction, that improves performance and can be integrated with existing methods.

Findings

01

Improved performance on benchmark datasets

02

Effective reduction of catastrophic forgetting

03

Compatible with multiple existing CL methods

Abstract

Continual learning (CL) is a fundamental topic in machine learning, where the goal is to train a model with continuously incoming data and tasks. Due to the memory limit, we cannot store all the historical data, and therefore confront the ``catastrophic forgetting'' problem, i.e., the performance on the previous tasks can substantially decrease because of the missing information in the latter period. Though a number of elegant methods have been proposed, the catastrophic forgetting phenomenon still cannot be well avoided in practice. In this paper, we study the problem from the gradient perspective, where our aim is to develop an effective algorithm to calibrate the gradient in each updating step of the model; namely, our goal is to guide the model to be updated in the right direction under the situation that a large amount of historical data are unavailable. Our idea is partly inspired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications

MethodsSparse Evolutionary Training