Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression
Zhaohui Zheng, Ping Wang, Wei Liu, Jinze Li, Rongguang Ye, and Dongwei Ren

TL;DR
This paper introduces Distance-IoU (DIoU) and Complete IoU (CIoU) losses that improve bounding box regression speed and accuracy in object detection by incorporating geometric factors, outperforming existing IoU-based losses.
Contribution
The paper proposes novel DIoU and CIoU loss functions that accelerate convergence and enhance bounding box regression accuracy in object detection models.
Findings
DIoU loss converges faster than IoU and GIoU losses.
CIoU loss incorporates aspect ratio and central point distance for better performance.
Integrating DIoU and CIoU into existing detectors improves IoU and GIoU metrics.
Abstract
Bounding box regression is the crucial step in object detection. In existing methods, while -norm loss is widely adopted for bounding box regression, it is not tailored to the evaluation metric, i.e., Intersection over Union (IoU). Recently, IoU loss and generalized IoU (GIoU) loss have been proposed to benefit the IoU metric, but still suffer from the problems of slow convergence and inaccurate regression. In this paper, we propose a Distance-IoU (DIoU) loss by incorporating the normalized distance between the predicted box and the target box, which converges much faster in training than IoU and GIoU losses. Furthermore, this paper summarizes three geometric factors in bounding box regression, \ie, overlap area, central point distance and aspect ratio, based on which a Complete IoU (CIoU) loss is proposed, thereby leading to faster convergence and better performance. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsConvolution · DIoU-NMS · Non Maximum Suppression · 1x1 Convolution · SSD
