DAMO-YOLO : A Report on Real-Time Object Detection Design
Xianzhe Xu, Yiqi Jiang, Weihua Chen, Yilun Huang, Yuan Zhang, Xiuyu, Sun

TL;DR
DAMO-YOLO is a new real-time object detection method that improves accuracy and speed over previous YOLO models by integrating NAS, reparameterized FPN, and label assignment enhancements, suitable for various devices.
Contribution
The paper introduces DAMO-YOLO, a novel object detection framework with innovative architecture search, efficient neck design, and label assignment, achieving state-of-the-art performance across multiple scales.
Findings
Achieves up to 51.9 mAP on COCO with 7.95 ms latency on T4 GPUs.
Outperforms previous YOLO models in accuracy and speed.
Provides lightweight models for edge devices with competitive performance.
Abstract
In this report, we present a fast and accurate object detection method dubbed DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO series. DAMO-YOLO is extended from YOLO with some new technologies, including Neural Architecture Search (NAS), efficient Reparameterized Generalized-FPN (RepGFPN), a lightweight head with AlignedOTA label assignment, and distillation enhancement. In particular, we use MAE-NAS, a method guided by the principle of maximum entropy, to search our detection backbone under the constraints of low latency and high performance, producing ResNet/CSP-like structures with spatial pyramid pooling and focus modules. In the design of necks and heads, we follow the rule of ``large neck, small head''.We import Generalized-FPN with accelerated queen-fusion to build the detector neck and upgrade its CSPNet with efficient layer aggregation networks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Image and Object Detection Techniques
MethodsDAMO-YOLO · Spatial Pyramid Pooling
