Compressed Object Detection
Gedeon Muhawenayo, Georgia Gkioxari

TL;DR
This paper presents a model compression method for object detection that reduces model size by 30% using pruning and weight sharing, enabling deployment on resource-constrained devices without performance loss.
Contribution
It extends pruning and weight sharing techniques specifically for object detection models, achieving significant compression while maintaining accuracy.
Findings
30% model size reduction without performance loss
Compressed models can be initialized with existing pre-trained weights
Effective for deployment on mobile and IoT devices
Abstract
Deep learning approaches have achieved unprecedented performance in visual recognition tasks such as object detection and pose estimation. However, state-of-the-art models have millions of parameters represented as floats which make them computationally expensive and constrain their deployment on hardware such as mobile phones and IoT nodes. Most commonly, activations of deep neural networks tend to be sparse thus proving that models are over parametrized with redundant neurons. Model compression techniques, such as pruning and quantization, have recently shown promising results by improving model complexity with little loss in performance. In this work, we extended pruning, a compression technique that discards unnecessary model connections, and weight sharing techniques for the task of object detection. With our approach, we are able to compress a state-of-the-art object detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsPruning
