Learning Local Features with Context Aggregation for Visual Localization

Siyu Hong; Kunhong Li; Yongcong Zhang; Zhiheng Fu; Mengyi Liu and; Yulan Guo

arXiv:2005.12880·cs.CV·June 2, 2020·1 cites

Learning Local Features with Context Aggregation for Visual Localization

Siyu Hong, Kunhong Li, Yongcong Zhang, Zhiheng Fu, Mengyi Liu and, Yulan Guo

PDF

Open Access

TL;DR

This paper introduces a novel approach for local feature learning in visual localization by fusing low-level features with multi-scale high-level semantic context, leading to improved robustness and state-of-the-art results.

Contribution

It proposes a method that combines low-level local features with high-level semantic features guided by a score map, enhancing feature discriminability.

Findings

01

Achieves state-of-the-art performance on local feature benchmark datasets.

02

Effectively fuses semantic context with local features for improved robustness.

03

Demonstrates superior results in visual localization tasks.

Abstract

Keypoint detection and description is fundamental yet important in many vision applications. Most existing methods use detect-then-describe or detect-and-describe strategy to learn local features without considering their context information. Consequently, it is challenging for these methods to learn robust local features. In this paper, we focus on the fusion of low-level textual information and high-level semantic context information to improve the discrimitiveness of local features. Specifically, we first estimate a score map to represent the distribution of potential keypoints according to the quality of descriptors of all pixels. Then, we extract and aggregate multi-scale high-level semantic features based by the guidance of the score map. Finally, the low-level local features and high-level semantic features are fused and refined using a residual module. Experiments on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization