Localization Distillation for Dense Object Detection
Zhaohui Zheng, Rongguang Ye, Ping Wang, Dongwei Ren and, Wangmeng Zuo, Qibin Hou, Ming-Ming Cheng

TL;DR
This paper introduces a novel localization distillation method for dense object detection that effectively transfers localization knowledge from teacher to student models, outperforming traditional feature imitation methods.
Contribution
It proposes a new localization distillation approach with valuable region selection, demonstrating superior performance over feature imitation in object detection tasks.
Findings
Localization distillation improves AP scores significantly.
Logit mimicking outperforms feature imitation.
Method is simple, effective, and applicable to various detectors.
Abstract
Knowledge distillation (KD) has witnessed its powerful capability in learning compact models in object detection. Previous KD methods for object detection mostly focus on imitating deep features within the imitation regions instead of mimicking classification logit due to its inefficiency in distilling localization information and trivial improvement. In this paper, by reformulating the knowledge distillation process on localization, we present a novel localization distillation (LD) method which can efficiently transfer the localization knowledge from the teacher to the student. Moreover, we also heuristically introduce the concept of valuable localization region that can aid to selectively distill the semantic and localization knowledge for a certain region. Combining these two new components, for the first time, we show that logit mimicking can outperform feature imitation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
