Generalized Focal Loss V2: Learning Reliable Localization Quality   Estimation for Dense Object Detection

Xiang Li; Wenhai Wang; Xiaolin Hu; Jun Li; Jinhui Tang; Jian Yang

arXiv:2011.12885·cs.CV·November 26, 2020·25 cites

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection

Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang

PDF

Open Access 5 Repos

TL;DR

GFLV2 introduces a novel approach to localization quality estimation in dense object detection by leveraging bounding box distribution statistics, resulting in improved accuracy without sacrificing efficiency.

Contribution

It pioneers the use of bounding box distribution statistics for LQE, enhancing detection performance with a lightweight, statistically grounded predictor.

Findings

01

Achieves 46.2 AP on COCO test-dev with ResNet-101.

02

Surpasses previous state-of-the-art by 2.6 AP.

03

Maintains real-time inference speed at 14.6 FPS.

Abstract

Localization Quality Estimation (LQE) is crucial and popular in the recent advancement of dense object detectors since it can provide accurate ranking scores that benefit the Non-Maximum Suppression processing and improve detection performance. As a common practice, most existing methods predict LQE scores through vanilla convolutional features shared with object classification or bounding box regression. In this paper, we explore a completely novel and different perspective to perform LQE -- based on the learned distributions of the four parameters of the bounding box. The bounding box distributions are inspired and introduced as "General Distribution" in GFLV1, which describes the uncertainty of the predicted bounding boxes well. Such a property makes the distribution statistics of a bounding box highly correlated to its real localization quality. Specifically, a bounding box…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI

MethodsAdaptive Training Sample Selection