Gradient Harmonized Single-stage Detector
Buyu Li, Yu Liu, Xiaogang Wang

TL;DR
This paper introduces Gradient Harmonized Mechanism (GHM), a novel approach to address training disharmonies in single-stage detectors by balancing gradient flows, leading to improved detection performance without extensive hyper-parameter tuning.
Contribution
The paper proposes GHM, a new gradient-based loss balancing method, with two specific loss functions GHM-C and GHM-R, enhancing single-stage detector accuracy.
Findings
Achieves 41.6 mAP on COCO test-dev set, surpassing previous methods.
Improves detection performance without extensive hyper-parameter tuning.
Demonstrates effectiveness of GHM in balancing gradient flow for classification and regression.
Abstract
Despite the great success of two-stage detectors, single-stage detector is still a more elegant and efficient way, yet suffers from the two well-known disharmonies during training, i.e. the huge difference in quantity between positive and negative examples as well as between easy and hard examples. In this work, we first point out that the essential effect of the two disharmonies can be summarized in term of the gradient. Further, we propose a novel gradient harmonizing mechanism (GHM) to be a hedging for the disharmonies. The philosophy behind GHM can be easily embedded into both classification loss function like cross-entropy (CE) and regression loss function like smooth- () loss. To this end, two novel loss functions called GHM-C and GHM-R are designed to balancing the gradient flow for anchor classification and bounding box refinement, respectively. Ablation study on MS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
MethodsGradient Harmonizing Mechanism R · Gradient Harmonizing Mechanism C · Average Pooling · ResNeXt Block · Grouped Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia?
