Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection
Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui, Tang, Jian Yang

TL;DR
This paper introduces Generalized Focal Loss (GFL), a novel loss function for dense object detection that unifies quality estimation, classification, and localization into a continuous, flexible framework, improving detection accuracy.
Contribution
It proposes a new representation for quality and localization that addresses inconsistencies and inflexibility in existing methods, and develops GFL to optimize these continuous labels effectively.
Findings
GFL achieves 45.0% AP on COCO with ResNet-101, outperforming state-of-the-art methods.
The model attains 48.2% AP at 10 FPS on a single GPU, demonstrating high accuracy and efficiency.
The approach unifies quality, classification, and localization into a continuous framework, improving detection performance.
Abstract
One-stage detector basically formulates object detection as dense classification and localization. The classification is usually optimized by Focal Loss and the box location is commonly learned under Dirac delta distribution. A recent trend for one-stage detectors is to introduce an individual prediction branch to estimate the quality of localization, where the predicted quality facilitates the classification to improve detection performance. This paper delves into the representations of the above three fundamental elements: quality estimation, classification and localization. Two problems are discovered in existing practices, including (1) the inconsistent usage of the quality estimation and classification between training and inference and (2) the inflexible Dirac delta distribution for localization when there is ambiguity and uncertainty in complex scenes. To address the problems, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsGeneralized Focal Loss · Average Pooling · Deformable Convolution · ResNeXt Block · Grouped Convolution · Kaiming Initialization · Global Average Pooling · Residual Connection · 1x1 Convolution · Convolution
