Rotated Object Detection via Scale-invariant Mahalanobis Distance in Aerial Images
Siyang Wen, Wei Guo, Yi Liu, Ruijie Wu

TL;DR
This paper introduces a scale-invariant Mahalanobis Distance Loss (MDL) for rotated object detection in aerial images, improving stability and performance over traditional ln-norm losses by aligning better with detection metrics.
Contribution
The paper proposes a novel Mahalanobis Distance Loss (MDL) for eight-parameter rotated object detection, addressing scale-invariance and boundary discontinuity issues.
Findings
MDL achieves state-of-the-art performance on DOTA-v1.0.
MDL outperforms smooth L1 loss in rotated object detection.
MDL provides more stable training due to scale-invariance.
Abstract
Rotated object detection in aerial images is a meaningful yet challenging task as objects are densely arranged and have arbitrary orientations. The eight-parameter (coordinates of box vectors) methods in rotated object detection usually use ln-norm losses (L1 loss, L2 loss, and smooth L1 loss) as loss functions. As ln-norm losses are mainly based on non-scale-invariant Minkowski distance, using ln-norm losses will lead to inconsistency with the detection metric rotational Intersection-over-Union (IoU) and training instability. To address the problems, we use Mahalanobis distance to calculate loss between the predicted and the target box vertices' vectors, proposing a new loss function called Mahalanobis Distance Loss (MDL) for eight-parameter rotated object detection. As Mahalanobis distance is scale-invariant, MDL is more consistent with detection metric and more stable during training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMinimum Description Length
