A Faster Path to Continual Learning

Wei Li; Hangjie Yuan; Zixiang Zhao; Borui Kang; Ziwei Liu; Tao Feng

arXiv:2604.11064·cs.LG·April 16, 2026

A Faster Path to Continual Learning

Wei Li, Hangjie Yuan, Zixiang Zhao, Borui Kang, Ziwei Liu, Tao Feng

PDF

TL;DR

This paper introduces C-Flat Turbo, an optimized version of C-Flat for continual learning that reduces training overhead by skipping redundant gradient computations, achieving faster training with comparable or better accuracy.

Contribution

It proposes a novel gradient skipping technique and adaptive scheduling to significantly accelerate C-Flat without sacrificing performance in continual learning.

Findings

01

C-Flat Turbo is 1.0× to 1.25× faster than C-Flat.

02

It maintains or improves accuracy across various continual learning methods.

03

The approach reduces gradient computation overhead in the optimization process.

Abstract

Continual Learning (CL) aims to train neural networks on a dynamic stream of tasks without forgetting previously learned knowledge. Among optimization-based approaches, C-Flat has emerged as a promising solution due to its plug-and-play nature and its ability to encourage uniformly low-loss regions for both new and old tasks. However, C-Flat requires three additional gradient computations per iteration, imposing substantial overhead on the optimization process. In this work, we propose C-Flat Turbo, a faster yet stronger optimizer that significantly reduces the training cost. We show that the gradients associated with first-order flatness contain direction-invariant components relative to the proxy-model gradients, enabling us to skip redundant gradient computations in the perturbed ascent steps. Moreover, we observe that these flatness-promoting gradients progressively stabilize across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.