SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector
Shuailei Ma, Yuefeng Wang, Ying Wei, Jiaqi Fan, Enming Zhang, Xinyu, Sun, Peihao Chen

TL;DR
This paper introduces SKDF, a simple knowledge distillation framework that effectively transfers open-world knowledge to object detectors, improving unknown object detection while mitigating catastrophic forgetting through novel loss and decoding structures.
Contribution
The paper proposes a novel knowledge distillation method with a down-weight loss and a cascade decouple decoding structure for open-world object detection.
Findings
Effective in detecting unknown objects in open-world scenarios.
Reduces catastrophic forgetting of known objects during distillation.
Outperforms existing methods on OWOD and new benchmarks.
Abstract
In this paper, we attempt to specialize the VLM model for OWOD tasks by distilling its open-world knowledge into a language-agnostic detector. Surprisingly, we observe that the combination of a simple \textbf{knowledge distillation} approach and the automatic pseudo-labeling mechanism in OWOD can achieve better performance for unknown object detection, even with a small amount of data. Unfortunately, knowledge distillation for unknown objects severely affects the learning of detectors with conventional structures for known objects, leading to catastrophic forgetting. To alleviate these problems, we propose the \textbf{down-weight loss function} for knowledge distillation from vision-language to single vision modality. Meanwhile, we propose the \textbf{cascade decouple decoding structure} that decouples the learning of localization and recognition to reduce the impact of category…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection
MethodsKnowledge Distillation
