From Horizontal to Rotated: Cross-View Object Geo-Localization with Orientation Awareness
Chenlin Fu, Ao Gong, Yingying Zhu

TL;DR
This paper introduces OSGeo, a novel framework for cross-view object geo-localization that uses rotated bounding boxes for better orientation fit, achieving high accuracy with significantly lower annotation costs.
Contribution
The paper proposes RBoxes for improved geometric fitting in detection-based geo-localization and introduces OSGeo with a multi-scale perception module and orientation-sensitive head.
Findings
OSGeo achieves state-of-the-art accuracy in CVOGL.
OSGeo surpasses segmentation-based methods in precision.
The new dataset CVOGL-R provides precise RBox annotations.
Abstract
Cross-View object geo-localization (CVOGL) aims to precisely determine the geographic coordinates of a query object from a ground or drone perspective by referencing a satellite map. Segmentation-based approaches offer high precision but require prohibitively expensive pixel-level annotations, whereas more economical detection-based methods suffer from lower accuracy. This performance disparity in detection is primarily caused by two factors: the poor geometric fit of Horizontal Bounding Boxes (HBoxes) for oriented objects and the degradation in precision due to feature map scaling. Motivated by these, we propose leveraging Rotated Bounding Boxes (RBoxes) as a natural extension of the detection-based paradigm. RBoxes provide a much tighter geometric fit to oriented objects. Building on this, we introduce OSGeo, a novel geo-localization framework, meticulously designed with a multi-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
