Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization
Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin, Tang, Tao He, Junchi Yan

TL;DR
This paper introduces a novel Gaussian distribution-based approach for rotated object detection, replacing traditional bounding box methods, leading to improved high-precision detection and extending to 3-D with broad applicability.
Contribution
The paper proposes modeling rotated objects as Gaussian distributions and a new regression loss based on Kullback-Leibler Divergence, addressing fundamental limitations of traditional bounding box methods.
Findings
Outperforms existing methods on twelve datasets
Effectively handles boundary discontinuity and square-like issues
Extends to 3-D object detection with tailored algorithms
Abstract
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects. We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection, especially for high-precision detection with high IoU (e.g. 0.75). Instead, we propose to model the rotated objects as Gaussian distributions. A direct advantage is that our new regression loss regarding the distance between two Gaussians e.g. Kullback-Leibler Divergence (KLD), can well align the actual detection performance metric, which is not well addressed in existing methods. Moreover, the two bottlenecks i.e. boundary discontinuity and square-like problem also disappear. We also propose an efficient Gaussian metric-based label assignment strategy to further boost the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Automated Road and Building Extraction · Robotics and Sensor-Based Localization
MethodsALIGN · Balanced Selection
