Forget Less, Retain More: A Lightweight Regularizer for Rehearsal-Based Continual Learning

Lama Alssum; Hasan Abed Al Kader Hammoud; Motasem Alfarra; Juan C Leon Alcazar; Bernard Ghanem

arXiv:2512.01818·cs.LG·December 2, 2025

Forget Less, Retain More: A Lightweight Regularizer for Rehearsal-Based Continual Learning

Lama Alssum, Hasan Abed Al Kader Hammoud, Motasem Alfarra, Juan C Leon Alcazar, Bernard Ghanem

PDF

Open Access

TL;DR

This paper introduces a lightweight, class-agnostic regularizer called Information Maximization (IM) that enhances rehearsal-based continual learning by reducing forgetting and improving convergence across diverse datasets, including video data.

Contribution

The paper proposes a novel, minimal-overhead regularizer for rehearsal-based continual learning that is class-agnostic and effective across multiple domains and data types.

Findings

01

Consistently improves baseline performance across datasets and tasks.

02

Reduces forgetting with minimal computational overhead.

03

Enhances video continual learning performance despite domain differences.

Abstract

Deep neural networks suffer from catastrophic forgetting, where performance on previous tasks degrades after training on a new task. This issue arises due to the model's tendency to overwrite previously acquired knowledge with new information. We present a novel approach to address this challenge, focusing on the intersection of memory-based methods and regularization approaches. We formulate a regularization strategy, termed Information Maximization (IM) regularizer, for memory-based continual learning methods, which is based exclusively on the expected label distribution, thus making it class-agnostic. As a consequence, IM regularizer can be directly integrated into various rehearsal-based continual learning methods, reducing forgetting and favoring faster convergence. Our empirical validation shows that, across datasets and regardless of the number of tasks, our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Face recognition and analysis · Visual Attention and Saliency Detection