SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching

Xiangzeng Liu; Chi Wang; Guanglu Shi; Xiaodong Zhang; Qiguang Miao; Miao Fan

arXiv:2508.02278·cs.CV·August 12, 2025

SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching

Xiangzeng Liu, Chi Wang, Guanglu Shi, Xiaodong Zhang, Qiguang Miao, Miao Fan

PDF

Open Access

TL;DR

SGAD introduces a novel semantic and geometric-aware descriptor network that enhances local feature matching by generating highly discriminative descriptors, enabling direct matching without complex graph optimization, significantly improving accuracy and efficiency.

Contribution

The paper proposes SGAD, a new descriptor network that rethinks area-based matching, along with a supervision strategy and redundancy filter, to improve matching accuracy and efficiency without complex graph matching.

Findings

01

Reduces runtime by 60x compared to MESA.

02

Achieves higher accuracy in outdoor pose estimation.

03

Sets new state-of-the-art in indoor pose estimation.

Abstract

Local feature matching remains a fundamental challenge in computer vision. Recent Area to Point Matching (A2PM) methods have improved matching accuracy. However, existing research based on this framework relies on inefficient pixel-level comparisons and complex graph matching that limit scalability. In this work, we introduce the Semantic and Geometric-aware Descriptor Network (SGAD), which fundamentally rethinks area-based matching by generating highly discriminative area descriptors that enable direct matching without complex graph optimization. This approach significantly improves both accuracy and efficiency of area matching. We further improve the performance of area matching through a novel supervision strategy that decomposes the area matching task into classification and ranking subtasks. Finally, we introduce the Hierarchical Containment Redundancy Filter (HCRF) to eliminate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Graph Theory and Algorithms · Robotics and Sensor-Based Localization