Rebalancing Batch Normalization for Exemplar-based Class-Incremental Learning
Sungmin Cha, Sungjun Cho, Dasol Hwang, Sunwon Hong, Moontae Lee, and, Taesup Moon

TL;DR
This paper introduces Task-Balanced Batch Normalization (TBBN), a new method designed to mitigate data imbalance in exemplar-based class-incremental learning, improving model retention across multiple datasets.
Contribution
We propose TBBN, a hyperparameter-free BN variant that addresses data imbalance in offline CIL, outperforming existing BN variants in various datasets.
Findings
TBBN outperforms other BN variants on CIFAR-100, ImageNet-100, and five diverse datasets.
TBBN is compatible with existing exemplar-based CIL algorithms and improves their performance.
TBBN maintains identical inference behavior to vanilla BN, ensuring ease of integration.
Abstract
Batch Normalization (BN) and its variants has been extensively studied for neural nets in various computer vision tasks, but relatively little work has been dedicated to studying the effect of BN in continual learning. To that end, we develop a new update patch for BN, particularly tailored for the exemplar-based class-incremental learning (CIL). The main issue of BN in CIL is the imbalance of training data between current and past tasks in a mini-batch, which makes the empirical mean and variance as well as the learnable affine transformation parameters of BN heavily biased toward the current task -- contributing to the forgetting of past tasks. While one of the recent BN variants has been developed for "online" CIL, in which the training is done with a single epoch, we show that their method does not necessarily bring gains for "offline" CIL, in which a model is trained with multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsBatch Normalization
