In Teacher We Trust: Learning Compressed Models for Pedestrian Detection
Jonathan Shen, Noranart Vesdapunt, Vishnu N. Boddeti, Kris M. Kitani

TL;DR
This paper introduces a novel knowledge distillation approach with a hint layer and uncertainty modeling to effectively train small pedestrian detection models, achieving high accuracy with significantly fewer parameters.
Contribution
It proposes a new method combining a hint layer, uncertainty estimation, and hand-designed features to improve small model training for pedestrian detection.
Findings
Small model with 400x fewer parameters outperforms AlexNet on Caltech dataset.
Enhanced distillation method improves small model accuracy.
Incorporating uncertainty and features boosts pedestrian detection performance.
Abstract
Deep convolutional neural networks continue to advance the state-of-the-art in many domains as they grow bigger and more complex. It has been observed that many of the parameters of a large network are redundant, allowing for the possibility of learning a smaller network that mimics the outputs of the large network through a process called Knowledge Distillation. We show, however, that standard Knowledge Distillation is not effective for learning small models for the task of pedestrian detection. To improve this process, we introduce a higher-dimensional hint layer to increase information flow. We also estimate the variance in the outputs of the large network and propose a loss function to incorporate this uncertainty. Finally, we attempt to boost the complexity of the small network without increasing its size by using as input hand-designed features that have been demonstrated to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
MethodsKnowledge Distillation · 1x1 Convolution · Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax
