Libra R-CNN: Towards Balanced Learning for Object Detection
Jiangmiao Pang, Kai Chen, Jianping Shi, Huajun Feng, Wanli Ouyang,, Dahua Lin

TL;DR
Libra R-CNN introduces a balanced learning framework for object detection by addressing sample, feature, and objective level imbalances, leading to significant performance improvements over existing models.
Contribution
It proposes three novel components—IoU-balanced sampling, balanced feature pyramid, and balanced L1 loss—that collectively enhance detection accuracy.
Findings
Achieves 2.5 AP improvement over FPN Faster R-CNN.
Achieves 2.0 AP improvement over RetinaNet.
Effectively mitigates training imbalance issues.
Abstract
Compared with model architectures, the training process, which is also crucial to the success of detectors, has received relatively less attention in object detection. In this work, we carefully revisit the standard training practice of detectors, and find that the detection performance is often limited by the imbalance during the training process, which generally consists in three levels - sample level, feature level, and objective level. To mitigate the adverse effects caused thereby, we propose Libra R-CNN, a simple but effective framework towards balanced learning for object detection. It integrates three novel components: IoU-balanced sampling, balanced feature pyramid, and balanced L1 loss, respectively for reducing the imbalance at sample, feature, and objective level. Benefitted from the overall balanced design, Libra R-CNN significantly improves the detection performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Adversarial Robustness in Machine Learning
MethodsAverage Pooling · ResNeXt Block · Non-Local Operation · Embedded Gaussian Affinity · Non-Local Block · IoU-Balanced Sampling · Balanced L1 Loss · Balanced Feature Pyramid · Focal Loss · Feature Pyramid Network
