Rotated Object Detection via Scale-invariant Mahalanobis Distance in   Aerial Images

Siyang Wen; Wei Guo; Yi Liu; Ruijie Wu

arXiv:2204.00840·cs.CV·October 5, 2022

Rotated Object Detection via Scale-invariant Mahalanobis Distance in Aerial Images

Siyang Wen, Wei Guo, Yi Liu, Ruijie Wu

PDF

TL;DR

This paper introduces a scale-invariant Mahalanobis Distance Loss (MDL) for rotated object detection in aerial images, improving stability and performance over traditional ln-norm losses by aligning better with detection metrics.

Contribution

The paper proposes a novel Mahalanobis Distance Loss (MDL) for eight-parameter rotated object detection, addressing scale-invariance and boundary discontinuity issues.

Findings

01

MDL achieves state-of-the-art performance on DOTA-v1.0.

02

MDL outperforms smooth L1 loss in rotated object detection.

03

MDL provides more stable training due to scale-invariance.

Abstract

Rotated object detection in aerial images is a meaningful yet challenging task as objects are densely arranged and have arbitrary orientations. The eight-parameter (coordinates of box vectors) methods in rotated object detection usually use ln-norm losses (L1 loss, L2 loss, and smooth L1 loss) as loss functions. As ln-norm losses are mainly based on non-scale-invariant Minkowski distance, using ln-norm losses will lead to inconsistency with the detection metric rotational Intersection-over-Union (IoU) and training instability. To address the problems, we use Mahalanobis distance to calculate loss between the predicted and the target box vertices' vectors, proposing a new loss function called Mahalanobis Distance Loss (MDL) for eight-parameter rotated object detection. As Mahalanobis distance is scale-invariant, MDL is more consistent with detection metric and more stable during training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMinimum Description Length