ThunderNet: Towards Real-time Generic Object Detection
Zheng Qin, Zeming Li, Zhaoning Zhang, Yiping Bao, Gang Yu, Yuxing, Peng, Jian Sun

TL;DR
ThunderNet is a lightweight, efficient two-stage object detector designed for real-time performance on mobile devices, achieving high accuracy with low computational cost and running at 24.1 fps on ARM platforms.
Contribution
The paper introduces ThunderNet, a novel lightweight two-stage detector with a specialized backbone and detection modules optimized for real-time mobile detection.
Findings
Achieves superior performance with only 40% of the computational cost compared to other lightweight detectors.
Runs at 24.1 fps on ARM-based devices, demonstrating real-time capability.
First reported real-time detector on ARM platforms.
Abstract
Real-time generic object detection on mobile platforms is a crucial but challenging computer vision task. However, previous CNN-based detectors suffer from enormous computational cost, which hinders them from real-time inference in computation-constrained scenarios. In this paper, we investigate the effectiveness of two-stage detectors in real-time generic detection and propose a lightweight two-stage detector named ThunderNet. In the backbone part, we analyze the drawbacks in previous lightweight backbones and present a lightweight backbone designed for object detection. In the detection part, we exploit an extremely efficient RPN and detection head design. To generate more discriminative feature representation, we design two efficient architecture blocks, Context Enhancement Module and Spatial Attention Module. At last, we investigate the balance between the input resolution, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Position-Sensitive RoIAlign · Channel Shuffle · ShuffleNet V2 Block · Global Average Pooling · 1x1 Convolution · Batch Normalization · Depthwise Convolution · Dense Connections · Softmax
