Gradient Harmonized Single-stage Detector

Buyu Li; Yu Liu; Xiaogang Wang

arXiv:1811.05181·cs.CV·November 14, 2018·67 cites

Gradient Harmonized Single-stage Detector

Buyu Li, Yu Liu, Xiaogang Wang

PDF

Open Access 5 Repos

TL;DR

This paper introduces Gradient Harmonized Mechanism (GHM), a novel approach to address training disharmonies in single-stage detectors by balancing gradient flows, leading to improved detection performance without extensive hyper-parameter tuning.

Contribution

The paper proposes GHM, a new gradient-based loss balancing method, with two specific loss functions GHM-C and GHM-R, enhancing single-stage detector accuracy.

Findings

01

Achieves 41.6 mAP on COCO test-dev set, surpassing previous methods.

02

Improves detection performance without extensive hyper-parameter tuning.

03

Demonstrates effectiveness of GHM in balancing gradient flow for classification and regression.

Abstract

Despite the great success of two-stage detectors, single-stage detector is still a more elegant and efficient way, yet suffers from the two well-known disharmonies during training, i.e. the huge difference in quantity between positive and negative examples as well as between easy and hard examples. In this work, we first point out that the essential effect of the two disharmonies can be summarized in term of the gradient. Further, we propose a novel gradient harmonizing mechanism (GHM) to be a hedging for the disharmonies. The philosophy behind GHM can be easily embedded into both classification loss function like cross-entropy (CE) and regression loss function like smooth- $L_{1}$ ( $S L_{1}$ ) loss. To this end, two novel loss functions called GHM-C and GHM-R are designed to balancing the gradient flow for anchor classification and bounding box refinement, respectively. Ablation study on MS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications

MethodsGradient Harmonizing Mechanism R · Gradient Harmonizing Mechanism C · Average Pooling · ResNeXt Block · Grouped Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia?