Semantic-aware Representation Learning for Homography Estimation

Yuhan Liu; Qianxin Huang; Siqi Hui; Jingwen Fu; Sanping Zhou; Kangyi; Wu; Pengna Li; Jinjun Wang

arXiv:2407.13284·cs.IR·October 15, 2024

Semantic-aware Representation Learning for Homography Estimation

Yuhan Liu, Qianxin Huang, Siqi Hui, Jingwen Fu, Sanping Zhou, Kangyi, Wu, Pengna Li, Jinjun Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces SRMatcher, a semantic-aware, detector-free feature matching method utilizing vision foundation models and a fusion block to improve homography estimation accuracy, surpassing state-of-the-art results.

Contribution

The paper presents a novel semantic-aware feature learning framework with a fusion block, enhancing detector-free matching methods for homography estimation.

Findings

01

SRMatcher achieves state-of-the-art performance on multiple datasets.

02

It increases the AUC by about 11% on HPatches compared to previous methods.

03

SRMatcher improves precision when integrated with other matching frameworks like LoFTR.

Abstract

Homography estimation is the task of determining the transformation from an image pair. Our approach focuses on employing detector-free feature matching methods to address this issue. Previous work has underscored the importance of incorporating semantic information, however there still lacks an efficient way to utilize semantic information. Previous methods suffer from treating the semantics as a pre-processing, causing the utilization of semantics overly coarse-grained and lack adaptability when dealing with different tasks. In our work, we seek another way to use the semantic information, that is semantic-aware feature representation learning framework.Based on this, we propose SRMatcher, a new detector-free feature matching method, which encourages the network to learn integrated semantic feature representation.Specifically, to capture precise and rich semantics, we leverage the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lyh200095/srmatcher
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Medical Imaging and Analysis