Gaussian Combined Distance: A Generic Metric for Object Detection
Ziqian Guan, Xieyi Fu, Pengjun Huang, Hengyuan Zhang, Hubin Du, Yongtao Liu, Yinglin Wang, Qang Ma

TL;DR
This paper introduces the Gaussian Combined Distance (GCD), a new scale-invariant similarity metric for object detection that improves localization accuracy and generalizes well across datasets, outperforming existing metrics like IoU and Wasserstein Distance.
Contribution
The paper proposes GCD, a novel scale-invariant metric that enables joint optimization of bounding box parameters, leading to enhanced detection performance and faster convergence.
Findings
GCD achieves state-of-the-art results on AI-TOD-v2 for tiny objects.
GCD outperforms Wasserstein Distance on MS-COCO-2017 and Visdrone-2019 datasets.
GCD improves model convergence and localization accuracy.
Abstract
In object detection, a well-defined similarity metric can significantly enhance model performance. Currently, the IoU-based similarity metric is the most commonly preferred choice for detectors. However, detectors using IoU as a similarity metric often perform poorly when detecting small objects because of their sensitivity to minor positional deviations. To address this issue, recent studies have proposed the Wasserstein Distance as an alternative to IoU for measuring the similarity of Gaussian-distributed bounding boxes. However, we have observed that the Wasserstein Distance lacks scale invariance, which negatively impacts the model's generalization capability. Additionally, when used as a loss function, its independent optimization of the center attributes leads to slow model convergence and unsatisfactory detection precision. To address these challenges, we introduce the Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Face recognition and analysis · Domain Adaptation and Few-Shot Learning
