TL;DR
This paper improves rotated object detection by introducing a Hausdorff distance-based matching cost and an adaptive query denoising method, significantly boosting performance on multiple datasets.
Contribution
It proposes a novel Hausdorff distance-based bipartite matching and an adaptive query denoising strategy to enhance rotated DETR detection accuracy.
Findings
Achieved +4.18 AP50 on DOTA-v2.0
Achieved +4.59 AP50 on DOTA-v1.5
Achieved +4.99 AP50 on DIOR-R
Abstract
Detection Transformers (DETR) have recently set new benchmarks in object detection. However, their performance in detecting rotated objects lags behind established oriented object detectors. Our analysis identifies a key observation: the boundary discontinuity and square-like problem in bipartite matching poses an issue with assigning appropriate ground truths to predictions, leading to duplicate low-confidence predictions. To address this, we introduce a Hausdorff distance-based cost for bipartite matching, which more accurately quantifies the discrepancy between predictions and ground truths. Additionally, we find that a static denoising approach impedes the training of rotated DETR, especially as the quality of the detector's predictions begins to exceed that of the noised ground truths. To overcome this, we propose an adaptive query denoising method that employs bipartite matching…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
