Hierarchical Attention Fusion for Geo-Localization

Liqi Yan; Yiming Cui; Yingjie Chen; Dongfang Liu

arXiv:2102.09186·cs.CV·February 19, 2021·1 cites

Hierarchical Attention Fusion for Geo-Localization

Liqi Yan, Yiming Cui, Yingjie Chen, Dongfang Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical attention fusion network that leverages multi-scale features from CNNs to improve robustness in geo-localization tasks, especially under drastic scale variations.

Contribution

It proposes a novel hierarchical attention fusion approach with self-supervised training for enhanced multi-scale feature integration in geo-localization.

Findings

01

Outperforms existing state-of-the-art methods on large-scale benchmarks.

02

Effective handling of drastic scale variations in scene localization.

03

Self-supervised adaptive weighting improves feature emphasis.

Abstract

Geo-localization is a critical task in computer vision. In this work, we cast the geo-localization as a 2D image retrieval task. Current state-of-the-art methods for 2D geo-localization are not robust to locate a scene with drastic scale variations because they only exploit features from one semantic level for image representations. To address this limitation, we introduce a hierarchical attention fusion network using multi-scale features for geo-localization. We extract the hierarchical feature maps from a convolutional neural network (CNN) and organically fuse the extracted features for image representations. Our training is self-supervised using adaptive weights to control the attention of feature emphasis from each hierarchical level. Evaluation results on the image retrieval and the large-scale geo-localization benchmarks indicate that our method outperforms the existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YanLiqi/HAF
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization