VarifocalNet: An IoU-aware Dense Object Detector
Haoyang Zhang, Ying Wang, Feras Dayoub, Niko S\"underhauf

TL;DR
VarifocalNet introduces an IoU-aware classification score and a new loss function to improve dense object detection accuracy, achieving state-of-the-art results on MS COCO by better ranking candidate detections.
Contribution
The paper proposes a novel IoU-aware classification score, Varifocal Loss, and a star-shaped bounding box feature for improved dense object detection.
Findings
VFNet surpasses baseline by ~2.0 AP on MS COCO
Achieves 55.1 AP on COCO test-dev with Res2Net-101-DCN
State-of-the-art performance among object detectors
Abstract
Accurately ranking the vast number of candidate detections is crucial for dense object detectors to achieve high performance. Prior work uses the classification score or a combination of classification and predicted localization scores to rank candidates. However, neither option results in a reliable ranking, thus degrading detection performance. In this paper, we propose to learn an Iou-aware Classification Score (IACS) as a joint representation of object presence confidence and localization accuracy. We show that dense object detectors can achieve a more accurate ranking of candidate detections based on the IACS. We design a new loss function, named Varifocal Loss, to train a dense object detector to predict the IACS, and propose a new star-shaped bounding box feature representation for IACS prediction and bounding box refinement. Combining these two new components and a bounding box…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsVarifocal Loss · VarifocalNet · Convolution · 1x1 Convolution · Feature Pyramid Network · Non Maximum Suppression · FCOS
