ResMatch: Residual Attention Learning for Local Feature Matching
Yuxin Deng, Jiayi Ma

TL;DR
ResMatch introduces residual attention mechanisms for local feature matching, enhancing accuracy and efficiency by integrating descriptor similarity and spatial relations, validated through extensive experiments in matching, pose estimation, and localization.
Contribution
The paper proposes a novel residual attention learning framework for feature matching that incorporates descriptor and spatial information, improving performance and computational efficiency.
Findings
ResMatch outperforms existing methods in feature matching accuracy.
Sparse attention within neighborhoods enhances computational efficiency.
The approach improves pose estimation and visual localization results.
Abstract
Attention-based graph neural networks have made great progress in feature matching learning. However, insight of how attention mechanism works for feature matching is lacked in the literature. In this paper, we rethink cross- and self-attention from the viewpoint of traditional feature matching and filtering. In order to facilitate the learning of matching and filtering, we inject the similarity of descriptors and relative positions into cross- and self-attention score, respectively. In this way, the attention can focus on learning residual matching and filtering functions with reference to the basic functions of measuring visual and spatial correlation. Moreover, we mine intra- and inter-neighbors according to the similarity of descriptors and relative positions. Then sparse attention for each point can be performed only within its neighborhoods to acquire higher computation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsFocus
