Dual-Teacher Class-Incremental Learning With Data-Free Generative Replay

Yoojin Choi; Mostafa El-Khamy; Jungwon Lee

arXiv:2106.09835·cs.CV·June 21, 2021

Dual-Teacher Class-Incremental Learning With Data-Free Generative Replay

Yoojin Choi, Mostafa El-Khamy, Jungwon Lee

PDF

TL;DR

This paper introduces data-free generative replay and dual-teacher knowledge distillation techniques to improve class-incremental learning, reducing reliance on pre-trained generative models and enhancing knowledge transfer, demonstrated on CIFAR-100 and ImageNet.

Contribution

It presents novel methods for data-free generative replay and dual-teacher distillation, advancing class-incremental learning without pre-trained generative models.

Findings

01

Improved accuracy on CIFAR-100 and ImageNet datasets.

02

Reduced memory and training costs compared to traditional methods.

03

Enhanced knowledge transfer in incremental learning scenarios.

Abstract

This paper proposes two novel knowledge transfer techniques for class-incremental learning (CIL). First, we propose data-free generative replay (DF-GR) to mitigate catastrophic forgetting in CIL by using synthetic samples from a generative model. In the conventional generative replay, the generative model is pre-trained for old data and shared in extra memory for later incremental learning. In our proposed DF-GR, we train a generative model from scratch without using any training data, based on the pre-trained classification model from the past, so we curtail the cost of sharing pre-trained generative models. Second, we introduce dual-teacher information distillation (DT-ID) for knowledge distillation from two teachers to one student. In CIL, we use DT-ID to learn new classes incrementally based on the pre-trained model for old classes and another model (pre-)trained on the new data for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation