Learning Local Features with Context Aggregation for Visual Localization
Siyu Hong, Kunhong Li, Yongcong Zhang, Zhiheng Fu, Mengyi Liu and, Yulan Guo

TL;DR
This paper introduces a novel approach for local feature learning in visual localization by fusing low-level features with multi-scale high-level semantic context, leading to improved robustness and state-of-the-art results.
Contribution
It proposes a method that combines low-level local features with high-level semantic features guided by a score map, enhancing feature discriminability.
Findings
Achieves state-of-the-art performance on local feature benchmark datasets.
Effectively fuses semantic context with local features for improved robustness.
Demonstrates superior results in visual localization tasks.
Abstract
Keypoint detection and description is fundamental yet important in many vision applications. Most existing methods use detect-then-describe or detect-and-describe strategy to learn local features without considering their context information. Consequently, it is challenging for these methods to learn robust local features. In this paper, we focus on the fusion of low-level textual information and high-level semantic context information to improve the discrimitiveness of local features. Specifically, we first estimate a score map to represent the distribution of potential keypoints according to the quality of descriptors of all pixels. Then, we extract and aggregate multi-scale high-level semantic features based by the guidance of the score map. Finally, the low-level local features and high-level semantic features are fused and refined using a residual module. Experiments on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization
