RF-Net: An End-to-End Image Matching Network based on Receptive Field
Xuelun Shen, Cheng Wang, Xin Li, Zenglei Yu, Jonathan Li, Chenglu Wen,, Ming Cheng, Zijian He

TL;DR
RF-Net is an end-to-end trainable image matching network that leverages receptive fields and a novel neighbor mask loss to improve keypoint detection and descriptor stability, outperforming previous state-of-the-art methods.
Contribution
The paper introduces RF-Net, a new receptive field-based architecture with a neighbor mask loss, enhancing keypoint detection and training stability in image matching.
Findings
RF-Net outperforms existing methods on multiple benchmarks.
Receptive feature maps improve keypoint detection.
Neighbor mask loss stabilizes descriptor training.
Abstract
This paper proposes a new end-to-end trainable matching network based on receptive field, RF-Net, to compute sparse correspondence between images. Building end-to-end trainable matching framework is desirable and challenging. The very recent approach, LF-Net, successfully embeds the entire feature extraction pipeline into a jointly trainable pipeline, and produces the state-of-the-art matching results. This paper introduces two modifications to the structure of LF-Net. First, we propose to construct receptive feature maps, which lead to more effective keypoint detection. Second, we introduce a general loss function term, neighbor mask, to facilitate training patch selection. This results in improved stability in descriptor training. We trained RF-Net on the open dataset HPatches, and compared it with other methods on multiple benchmark datasets. Experiments show that RF-Net outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Multimodal Machine Learning Applications
