PP-YOLO: An Effective and Efficient Implementation of Object Detector
Xiang Long, Kaipeng Deng, Guanzhong Wang, Yang Zhang, Qingqing Dang,, Yuan Gao, Hui Shen, Jianguo Ren, Shumin Han, Errui Ding, Shilei Wen

TL;DR
PP-YOLO is an improved version of YOLOv3 that combines multiple optimization tricks to enhance accuracy while maintaining real-time speed, making it suitable for practical applications.
Contribution
The paper introduces PP-YOLO, a practical object detector based on YOLOv3 that improves accuracy without increasing model complexity, using various existing tricks.
Findings
Achieves 45.2% mAP, surpassing state-of-the-art detectors.
Maintains high inference speed at 72.9 FPS.
Effectively balances accuracy and efficiency for real-world use.
Abstract
Object detection is one of the most important areas in computer vision, which plays a key role in various practical scenarios. Due to limitation of hardware, it is often necessary to sacrifice accuracy to ensure the infer speed of the detector in practice. Therefore, the balance between effectiveness and efficiency of object detector must be considered. The goal of this paper is to implement an object detector with relatively balanced effectiveness and efficiency that can be directly applied in actual application scenarios, rather than propose a novel detection model. Considering that YOLOv3 has been widely used in practice, we develop a new object detector based on YOLOv3. We mainly try to combine various existing tricks that almost not increase the number of model parameters and FLOPs, to achieve the goal of improving the accuracy of detector as much as possible while ensuring that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · COVID-19 diagnosis using AI
MethodsBitcoin Customer Service Number +1-833-534-1729 · Residual Block · Kaiming Initialization · Bottleneck Residual Block · Deformable Convolution · CoordConv · Matrix Non-Maximum Suppression · Grid Sensitive · PP-YOLO · Residual Connection
