Architectural Insights into Knowledge Distillation for Object Detection: A Comprehensive Review
Mahdi Golizadeh, Nassibeh Golizadeh, Mohammad Ali Keyvanrad, Hossein Shirazi

TL;DR
This comprehensive review categorizes knowledge distillation methods for object detection, analyzing their effectiveness on benchmark datasets, and highlights challenges and future directions for scalable detection systems.
Contribution
Introduces a novel architecture-centric taxonomy for KD methods in object detection, covering CNN and Transformer-based models, with a comparative analysis on benchmark datasets.
Findings
Taxonomy clarifies KD approaches for object detection
Evaluation on MS COCO and PASCAL VOC datasets
Highlights challenges and future research directions
Abstract
Object detection has achieved remarkable accuracy through deep learning, yet these improvements often come with increased computational cost, limiting deployment on resource-constrained devices. Knowledge Distillation (KD) provides an effective solution by enabling compact student models to learn from larger teacher models. However, adapting KD to object detection poses unique challenges due to its dual objectives-classification and localization-as well as foreground-background imbalance and multi-scale feature representation. This review introduces a novel architecture-centric taxonomy for KD methods, distinguishing between CNN-based detectors (covering backbone-level, neck-level, head-level, and RPN/RoI-level distillation) and Transformer-based detectors (including query-level, feature-level, and logit-level distillation). We further evaluate representative methods using the MS COCO…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
