Complementary Calibration: Boosting General Continual Learning with Collaborative Distillation and Self-Supervision
Zhong Ji, Jin Li, Qiang Wang, Zhongfei Zhang

TL;DR
This paper introduces the CoCa framework that enhances general continual learning by addressing relation and feature deviations through collaborative distillation and self-supervision, leading to improved performance across multiple datasets.
Contribution
The paper proposes a novel Complementary Calibration framework combining collaborative distillation and self-supervision to mitigate key deviations in GCL, advancing continual learning methods.
Findings
Outperforms state-of-the-art methods on four datasets
Effectively maintains old task performance while learning new tasks
Addresses relation and feature deviations in GCL
Abstract
General Continual Learning (GCL) aims at learning from non independent and identically distributed stream data without catastrophic forgetting of the old tasks that don't rely on task boundaries during both training and testing stages. We reveal that the relation and feature deviations are crucial problems for catastrophic forgetting, in which relation deviation refers to the deficiency of the relationship among all classes in knowledge distillation, and feature deviation refers to indiscriminative feature representations. To this end, we propose a Complementary Calibration (CoCa) framework by mining the complementary model's outputs and features to alleviate the two deviations in the process of GCL. Specifically, we propose a new collaborative distillation approach for addressing the relation deviation. It distills model's outputs by utilizing ensemble dark knowledge of new model's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsContrastive Learning · Collaborative Distillation
