Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed
Yifan Wang, Xingyi He, Sida Peng, Dongli Tan, Xiaowei Zhou

TL;DR
This paper introduces an improved version of LoFTR that significantly enhances efficiency and accuracy in semi-dense image matching by employing aggregated attention and a two-stage correlation layer, enabling faster and more precise matching.
Contribution
The authors propose a novel aggregated attention mechanism and a two-stage correlation layer to improve LoFTR's efficiency and accuracy in semi-dense feature matching.
Findings
Our model is approximately 2.5 times faster than LoFTR.
It surpasses state-of-the-art efficient sparse matchers in accuracy.
It offers substantial efficiency benefits for large-scale and latency-sensitive applications.
Abstract
We present a novel method for efficiently producing semi-dense matches across images. Previous detector-free matcher LoFTR has shown remarkable matching capability in handling large-viewpoint change and texture-poor scenarios but suffers from low efficiency. We revisit its design choices and derive multiple improvements for both efficiency and accuracy. One key observation is that performing the transformer over the entire feature map is redundant due to shared local information, therefore we propose an aggregated attention mechanism with adaptive token selection for efficiency. Furthermore, we find spatial variance exists in LoFTR's fine correlation module, which is adverse to matching accuracy. A novel two-stage correlation layer is proposed to achieve accurate subpixel correspondences for accuracy improvement. Our efficiency optimized model is faster than LoFTR which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting
