PP-YOLO: An Effective and Efficient Implementation of Object Detector

Xiang Long; Kaipeng Deng; Guanzhong Wang; Yang Zhang; Qingqing Dang,; Yuan Gao; Hui Shen; Jianguo Ren; Shumin Han; Errui Ding; Shilei Wen

arXiv:2007.12099·cs.CV·August 4, 2020·234 cites

PP-YOLO: An Effective and Efficient Implementation of Object Detector

Xiang Long, Kaipeng Deng, Guanzhong Wang, Yang Zhang, Qingqing Dang,, Yuan Gao, Hui Shen, Jianguo Ren, Shumin Han, Errui Ding, Shilei Wen

PDF

Open Access 5 Repos

TL;DR

PP-YOLO is an improved version of YOLOv3 that combines multiple optimization tricks to enhance accuracy while maintaining real-time speed, making it suitable for practical applications.

Contribution

The paper introduces PP-YOLO, a practical object detector based on YOLOv3 that improves accuracy without increasing model complexity, using various existing tricks.

Findings

01

Achieves 45.2% mAP, surpassing state-of-the-art detectors.

02

Maintains high inference speed at 72.9 FPS.

03

Effectively balances accuracy and efficiency for real-world use.

Abstract

Object detection is one of the most important areas in computer vision, which plays a key role in various practical scenarios. Due to limitation of hardware, it is often necessary to sacrifice accuracy to ensure the infer speed of the detector in practice. Therefore, the balance between effectiveness and efficiency of object detector must be considered. The goal of this paper is to implement an object detector with relatively balanced effectiveness and efficiency that can be directly applied in actual application scenarios, rather than propose a novel detection model. Considering that YOLOv3 has been widely used in practice, we develop a new object detector based on YOLOv3. We mainly try to combine various existing tricks that almost not increase the number of model parameters and FLOPs, to achieve the goal of improving the accuracy of detector as much as possible while ensuring that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · COVID-19 diagnosis using AI

MethodsBitcoin Customer Service Number +1-833-534-1729 · Residual Block · Kaiming Initialization · Bottleneck Residual Block · Deformable Convolution · CoordConv · Matrix Non-Maximum Suppression · Grid Sensitive · PP-YOLO · Residual Connection