An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning
Gr\'egoire Petit, Michael Soumm, Eva Feillet, Adrian Popescu, Bertrand, Delezoide, David Picard, C\'eline Hudelot

TL;DR
This paper investigates how initial training strategies, including pre-training and first-batch training, impact class-incremental learning performance, providing a comprehensive analysis and practical recommendations for deployment.
Contribution
It offers the first in-depth statistical analysis of initial training strategies' effects on CIL, highlighting their dominance over other factors in incremental accuracy.
Findings
Initial training strategy significantly affects incremental accuracy.
Choice of CIL algorithm mainly influences forgetting.
Pre-trained models can enhance initial performance.
Abstract
Class-Incremental Learning (CIL) aims to build classification models from data streams. At each step of the CIL process, new classes must be integrated into the model. Due to catastrophic forgetting, CIL is particularly challenging when examples from past classes cannot be stored, the case on which we focus here. To date, most approaches are based exclusively on the target dataset of the CIL process. However, the use of models pre-trained in a self-supervised way on large amounts of data has recently gained momentum. The initial model of the CIL process may only use the first batch of the target dataset, or also use pre-trained weights obtained on an auxiliary dataset. The choice between these two initial learning strategies can significantly influence the performance of the incremental learning model, but has not yet been studied in depth. Performance is also influenced by the choice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Data Stream Mining Techniques
MethodsFocus
