CVGL: Causal Learning and Geometric Topology
Songsong Ouyang, Yingying Zhu

TL;DR
This paper introduces CLGT, a novel framework for cross-view geo-localization that combines causal feature extraction, geometric topology fusion, and adaptive pooling to improve robustness and accuracy in complex real-world scenarios.
Contribution
The paper proposes a new CLGT framework integrating causal learning and geometric topology to address viewpoint variations and confounding factors in CVGL.
Findings
Achieves state-of-the-art performance on CVUSA and CVACT datasets.
Demonstrates robustness under real-world corruptions.
Outperforms existing methods in challenging scenarios.
Abstract
Cross-view geo-localization (CVGL) aims to estimate the geographic location of a street image by matching it with a corresponding aerial image. This is critical for autonomous navigation and mapping in complex real-world scenarios. However, the task remains challenging due to significant viewpoint differences and the influence of confounding factors. To tackle these issues, we propose the Causal Learning and Geometric Topology (CLGT) framework, which integrates two key components: a Causal Feature Extractor (CFE) that mitigates the influence of confounding factors by leveraging causal intervention to encourage the model to focus on stable, task-relevant semantics; and a Geometric Topology Fusion (GT Fusion) module that injects Bird's Eye View (BEV) road topology into street features to alleviate cross-view inconsistencies caused by extreme perspective changes. Additionally, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
