RTMDet: An Empirical Study of Designing Real-Time Object Detectors
Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi, Liu, Shilong Zhang, Kai Chen

TL;DR
RTMDet is a new real-time object detector that outperforms existing models like YOLO in accuracy and speed, with versatile architecture suitable for various object recognition tasks, including segmentation and rotated detection.
Contribution
The paper introduces RTMDet, a highly efficient and extensible real-time object detector with a novel architecture using large-kernel depth-wise convolutions and improved training techniques.
Findings
Achieves 52.8% AP on COCO at 300+ FPS.
Outperforms mainstream industrial detectors in accuracy and speed.
Sets new state-of-the-art in real-time instance segmentation and rotated detection.
Abstract
In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection. To obtain a more efficient model architecture, we explore an architecture that has compatible capacities in the backbone and neck, constructed by a basic building block that consists of large-kernel depth-wise convolutions. We further introduce soft labels when calculating matching costs in the dynamic label assignment to improve accuracy. Together with better training techniques, the resulting object detector, named RTMDet, achieves 52.8% AP on COCO with 300+ FPS on an NVIDIA 3090 GPU, outperforming the current mainstream industrial detectors. RTMDet achieves the best parameter-accuracy trade-off with tiny/small/medium/large/extra-large model sizes for various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗akore/rtmw-m-256x192model· 81 dl81 dl
- 🤗akore/rtmw-l-256x192model· 75 dl75 dl
- 🤗akore/rtmw-x-256x192model· 84 dl84 dl
- 🤗akore/rtmw-l-384x288model· 90 dl90 dl
- 🤗akore/rtmw-x-384x288model· 89 dl89 dl
- 🤗akore/rtmdet-lmodel· 64 dl64 dl
- 🤗akore/rtmdet-mmodel· 53 dl53 dl
- 🤗akore/rtmdet-smodel· 50 dl50 dl
- 🤗akore/rtmdet-tinymodel· 69 dl69 dl
- 🤗Ambarella/RTMDetmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques
MethodsRTMDet: An Empirical Study of Designing Real-Time Object Detectors
