G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models
Pengyue Jia, Yiding Liu, Xiaopeng Li, Yuhao Wang, Yantong Du, Xiao, Han, Xuetao Wei, Shuaiqiang Wang, Dawei Yin, Xiangyu Zhao

TL;DR
G3 is a novel, adaptive framework for worldwide geolocalization that combines multi-modal representations and retrieval-augmented generation to improve location accuracy across diverse global images.
Contribution
The paper introduces G3, a new framework that integrates retrieval and generation with multi-modal learning for more accurate worldwide geolocalization.
Findings
G3 outperforms state-of-the-art methods on IM2GPS3k and YFCC4k datasets.
The framework effectively captures location-aware semantics across diverse images.
G3 demonstrates robustness to data heterogeneity and retrieval inconsistencies.
Abstract
Worldwide geolocalization aims to locate the precise location at the coordinate level of photos taken anywhere on the Earth. It is very challenging due to 1) the difficulty of capturing subtle location-aware visual semantics, and 2) the heterogeneous geographical distribution of image data. As a result, existing studies have clear limitations when scaled to a worldwide context. They may easily confuse distant images with similar visual contents, or cannot adapt to various locations worldwide with different amounts of relevant data. To resolve these limitations, we propose G3, a novel framework based on Retrieval-Augmented Generation (RAG). In particular, G3 consists of three steps, i.e., Geo-alignment, Geo-diversification, and Geo-verification to optimize both retrieval and generation phases of worldwide geolocalization. During Geo-alignment, our solution jointly learns expressive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGeographic Information Systems Studies
MethodsGreedy Policy Search
