Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching
Yujie Fu, Yihong Wu

TL;DR
This paper introduces Scale-Net, a neural network that accurately estimates scale ratios between images to improve large-scale invariant image matching, significantly enhancing existing feature matching methods.
Contribution
The paper proposes a novel neural network, Scale-Net, with a covisibility-attention module to better estimate scale differences, improving large-scale image matching accuracy.
Findings
Scale-Net achieves higher scale ratio estimation accuracy.
The method improves the performance of local feature matching.
Enhanced generalization ability over existing methods.
Abstract
Most image matching methods perform poorly when encountering large scale changes in images. To solve this problem, firstly, we propose a scale-difference-aware image matching method (SDAIM) that reduces image scale differences before local feature extraction, via resizing both images of an image pair according to an estimated scale ratio. Secondly, in order to accurately estimate the scale ratio, we propose a covisibility-attention-reinforced matching module (CVARM) and then design a novel neural network, termed as Scale-Net, based on CVARM. The proposed CVARM can lay more stress on covisible areas within the image pair and suppress the distraction from those areas visible in only one image. Quantitative and qualitative experiments confirm that the proposed Scale-Net has higher scale ratio estimation accuracy and much better generalization ability compared with all the existing scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Human Pose and Action Recognition
