Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery

Chandrakanth Gudavalli; Tajuddin Manhar Mohammed; Abhay Yadav; Ananth Vishnu Bhaskar; Hardik Prajapati; Cheng Peng; Rama Chellappa; Shivkumar Chandrasekaran; B. S. Manjunath

arXiv:2602.14929·cs.CV·February 17, 2026

Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery

Chandrakanth Gudavalli, Tajuddin Manhar Mohammed, Abhay Yadav, Ananth Vishnu Bhaskar, Hardik Prajapati, Cheng Peng, Rama Chellappa, Shivkumar Chandrasekaran, B. S. Manjunath

PDF

Open Access

TL;DR

Wrivinder is a geometry-driven framework that aligns ground images with satellite maps using 3D reconstruction and semantic cues, achieving sub-30 meter accuracy without supervision.

Contribution

It introduces Wrivinder, a zero-shot method for cross-view geo-localization, and MC-Sat, a new dataset for benchmarking ground-to-satellite alignment.

Findings

01

Achieves sub-30 meter geolocation accuracy in zero-shot tests.

02

Provides a new dataset, MC-Sat, for benchmarking cross-view localization.

03

Demonstrates robustness across diverse outdoor environments.

Abstract

Aligning ground-level imagery with geo-registered satellite maps is crucial for mapping, navigation, and situational awareness, yet remains challenging under large viewpoint gaps or when GPS is unreliable. We introduce Wrivinder, a zero-shot, geometry-driven framework that aggregates multiple ground photographs to reconstruct a consistent 3D scene and align it with overhead satellite imagery. Wrivinder combines SfM reconstruction, 3D Gaussian Splatting, semantic grounding, and monocular depth--based metric cues to produce a stable zenith-view rendering that can be directly matched to satellite context for metrically accurate camera geo-localization. To support systematic evaluation of this task, which lacks suitable benchmarks, we also release MC-Sat, a curated dataset linking multi-view ground imagery with geo-registered satellite tiles across diverse outdoor environments. Together,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging