Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning
Bingchen Huang, Zhineng Chen, Peng Zhou, Jiayin Chen, Zuxuan Wu

TL;DR
This paper addresses task confusion in dynamic expansion architectures for class incremental learning, proposing a novel method called TCIL that improves discrimination and fairness across tasks, leading to state-of-the-art results.
Contribution
The paper introduces TCIL, a multi-level knowledge distillation approach with attention and re-scoring, effectively reducing task confusion and catastrophic forgetting without rehearsal memory.
Findings
TCIL achieves state-of-the-art accuracy on CIFAR100 and ImageNet100.
TCIL effectively mitigates inter-task and old-new confusion.
TCIL outperforms existing methods in class incremental learning scenarios.
Abstract
The dynamic expansion architecture is becoming popular in class incremental learning, mainly due to its advantages in alleviating catastrophic forgetting. However, task confusion is not well assessed within this framework, e.g., the discrepancy between classes of different tasks is not well learned (i.e., inter-task confusion, ITC), and certain priority is still given to the latest class batch (i.e., old-new confusion, ONC). We empirically validate the side effects of the two types of confusion. Meanwhile, a novel solution called Task Correlated Incremental Learning (TCIL) is proposed to encourage discriminative and fair feature utilization across tasks. TCIL performs a multi-level knowledge distillation to propagate knowledge learned from old tasks to the new one. It establishes information flow paths at both feature and logit levels, enabling the learning to be aware of old classes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
MethodsKnowledge Distillation · Attentive Walk-Aggregating Graph Neural Network
