Semantic-aware Representation Learning for Homography Estimation
Yuhan Liu, Qianxin Huang, Siqi Hui, Jingwen Fu, Sanping Zhou, Kangyi, Wu, Pengna Li, Jinjun Wang

TL;DR
This paper introduces SRMatcher, a semantic-aware, detector-free feature matching method utilizing vision foundation models and a fusion block to improve homography estimation accuracy, surpassing state-of-the-art results.
Contribution
The paper presents a novel semantic-aware feature learning framework with a fusion block, enhancing detector-free matching methods for homography estimation.
Findings
SRMatcher achieves state-of-the-art performance on multiple datasets.
It increases the AUC by about 11% on HPatches compared to previous methods.
SRMatcher improves precision when integrated with other matching frameworks like LoFTR.
Abstract
Homography estimation is the task of determining the transformation from an image pair. Our approach focuses on employing detector-free feature matching methods to address this issue. Previous work has underscored the importance of incorporating semantic information, however there still lacks an efficient way to utilize semantic information. Previous methods suffer from treating the semantics as a pre-processing, causing the utilization of semantics overly coarse-grained and lack adaptability when dealing with different tasks. In our work, we seek another way to use the semantic information, that is semantic-aware feature representation learning framework.Based on this, we propose SRMatcher, a new detector-free feature matching method, which encourages the network to learn integrated semantic feature representation.Specifically, to capture precise and rich semantics, we leverage the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Medical Imaging and Analysis
