GradMix: Gradient-based Selective Mixup for Robust Data Augmentation in Class-Incremental Learning

Minsu Kim; Seong-Hyeon Hwang; Steven Euijong Whang

arXiv:2505.08528·cs.LG·December 24, 2025

GradMix: Gradient-based Selective Mixup for Robust Data Augmentation in Class-Incremental Learning

Minsu Kim, Seong-Hyeon Hwang, Steven Euijong Whang

PDF

TL;DR

GradMix is a novel gradient-based selective mixup data augmentation technique designed to reduce catastrophic forgetting in class-incremental learning, outperforming existing methods by intelligently mixing helpful class pairs.

Contribution

It introduces a class-based criterion for selective sample mixing, effectively mitigating knowledge loss during continual learning.

Findings

01

GradMix outperforms baseline data augmentation methods in accuracy.

02

It reduces catastrophic forgetting in class-incremental learning.

03

The method is validated on various real datasets.

Abstract

In the context of continual learning, acquiring new knowledge while maintaining previous knowledge presents a significant challenge. Existing methods often use experience replay techniques that store a small portion of previous task data for training. In experience replay approaches, data augmentation has emerged as a promising strategy to further improve the model performance by mixing limited previous task data with sufficient current task data. However, we theoretically and empirically analyze that training with mixed samples from random sample pairs may harm the knowledge of previous tasks and cause greater catastrophic forgetting. We then propose GradMix, a robust data augmentation method specifically designed for mitigating catastrophic forgetting in class-incremental learning. GradMix performs gradient-based selective mixup using a class-based criterion that mixes only samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMixup · Experience Replay