On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning
Pengfei Fang, Yongchun Qin, Hui Xue

TL;DR
This paper introduces a novel knowledge distillation method called Displacement Knowledge Distillation (DKD) that enhances few-shot class-incremental learning by incorporating displacement information, and proposes a dual network architecture to improve performance on evolving data distributions.
Contribution
The paper presents DKD, a new distillation technique using displacement instead of similarity, and DDNet, a dual network architecture for better handling base and novel classes in FSCIL.
Findings
DDNet achieves state-of-the-art results on three benchmarks.
DKD improves knowledge retention with displacement-based distillation.
The approach demonstrates robustness and general applicability.
Abstract
Few-shot Class-Incremental Learning (FSCIL) addresses the challenges of evolving data distributions and the difficulty of data acquisition in real-world scenarios. To counteract the catastrophic forgetting typically encountered in FSCIL, knowledge distillation is employed as a way to maintain the knowledge from learned data distribution. Recognizing the limitations of generating discriminative feature representations in a few-shot context, our approach incorporates structural information between samples into knowledge distillation. This structural information serves as a remedy for the low quality of features. Diverging from traditional structured distillation methods that compute sample similarity, we introduce the Displacement Knowledge Distillation (DKD) method. DKD utilizes displacement rather than similarity between samples, incorporating both distance and angular information to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsBalanced Selection · Knowledge Distillation
