UnitBox: An Advanced Object Detection Network

Jiahui Yu; Yuning Jiang; Zhangyang Wang; Zhimin Cao; Thomas Huang

arXiv:1608.01471·cs.CV·August 5, 2016

UnitBox: An Advanced Object Detection Network

Jiahui Yu, Yuning Jiang, Zhangyang Wang, Zhimin Cao, Thomas Huang

PDF

TL;DR

UnitBox introduces an IoU-based loss function and a fully convolutional network architecture for improved object detection, achieving state-of-the-art results in face detection with fast convergence and robustness to object scale and shape variations.

Contribution

The paper proposes a novel IoU loss function integrated into a deep fully convolutional network for more accurate bounding box regression in object detection.

Findings

01

Achieved top performance on FDDB face detection benchmark.

02

Demonstrated robustness to object scale and shape variations.

03

Converges faster than previous methods.

Abstract

In present object detection systems, the deep convolutional neural networks (CNNs) are utilized to predict bounding boxes of object candidates, and have gained performance advantages over the traditional region proposal methods. However, existing deep CNN methods assume the object bounds to be four independent variables, which could be regressed by the $ℓ_{2}$ loss separately. Such an oversimplified assumption is contrary to the well-received observation, that those variables are correlated, resulting to less accurate localization. To address the issue, we firstly introduce a novel Intersection over Union ( $I o U$ ) loss function for bounding box prediction, which regresses the four bounds of a predicted box as a whole unit. By taking the advantages of $I o U$ loss and deep fully convolutional networks, the UnitBox is introduced, which performs accurate and efficient localization, shows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.