Towards Diverse Evaluation of Class Incremental Learning: A Representation Learning Perspective
Sungmin Cha, Jihwan Kwak, Dongsub Shim, Hyunwoo Kim, Moontae Lee,, Honglak Lee, and Taesup Moon

TL;DR
This paper critically examines class incremental learning algorithms from a representation learning perspective, revealing that high accuracy does not always correlate with meaningful representation updates, and advocates for more diverse evaluation protocols.
Contribution
It introduces new analysis methods for evaluating representations in CIL, highlighting the importance of representation quality and stability beyond accuracy metrics.
Findings
Most SOTA algorithms prioritize stability over representation change.
High accuracy can be achieved with lower-quality representations.
Representation quality varies across algorithms and affects final performance.
Abstract
Class incremental learning (CIL) algorithms aim to continually learn new object classes from incrementally arriving data while not forgetting past learned classes. The common evaluation protocol for CIL algorithms is to measure the average test accuracy across all classes learned so far -- however, we argue that solely focusing on maximizing the test accuracy may not necessarily lead to developing a CIL algorithm that also continually learns and updates the representations, which may be transferred to the downstream tasks. To that end, we experimentally analyze neural network models trained by CIL algorithms using various evaluation protocols in representation learning and propose new analysis methods. Our experiments show that most state-of-the-art algorithms prioritize high stability and do not significantly change the learned representation, and sometimes even learn a representation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsTest · Balanced Selection
