A systematic study of the foreground-background imbalance problem in deep learning for object detection
Hanxue Gu, Haoyu Dong, Nicholas Konz, Maciej A. Mazurowski

TL;DR
This paper systematically analyzes the foreground-background imbalance problem in object detection, revealing its impact on performance and comparing nine methods across diverse datasets to identify the most effective solutions.
Contribution
It provides a comprehensive analysis of F-B imbalance effects and evaluates multiple methods, highlighting Libra-RCNN and PISA as top performers.
Findings
F-B imbalance significantly reduces detection accuracy.
Performance drops more with smaller objects than fewer objects.
Libra-RCNN and PISA outperform other methods in addressing F-B imbalance.
Abstract
The class imbalance problem in deep learning has been explored in several studies, but there has yet to be a systematic analysis of this phenomenon in object detection. Here, we present comprehensive analyses and experiments of the foreground-background (F-B) imbalance problem in object detection, which is very common and caused by small, infrequent objects of interest. We experimentally study the effects of different aspects of F-B imbalance (object size, number of objects, dataset size, object type) on detection performance. In addition, we also compare 9 leading methods for addressing this problem, including Faster-RCNN, SSD, OHEM, Libra-RCNN, Focal-Loss, GHM, PISA, YOLO-v3, and GFL with a range of datasets from different imaging domains. We conclude that (1) the F-B imbalance can indeed cause a significant drop in detection performance, (2) The detection performance is more affected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsConvolution · 1x1 Convolution · Non Maximum Suppression · SSD · PrIme Sample Attention · Online Hard Example Mining
