TL;DR
LoMa introduces a data-driven approach to local feature matching, leveraging large datasets and modern training techniques, achieving significant performance improvements on challenging benchmarks.
Contribution
The paper presents LoMa, a novel local feature matching model trained on diverse data and scaled resources, with a new challenging dataset and extensive benchmarking.
Findings
LoMa outperforms state-of-the-art methods on multiple benchmarks.
The new HardMatch dataset provides a more challenging evaluation environment.
Scaling data and compute leads to substantial performance gains.
Abstract
Local feature matching has long been a fundamental component of 3D vision systems such as Structure-from-Motion (SfM), yet progress has lagged behind the rapid advances of modern data-driven approaches. The newer approaches, such as feed-forward reconstruction models, have benefited extensively from scaling dataset sizes, whereas local feature matching models are still only trained on a few mid-sized datasets. In this paper, we revisit local feature matching from a data-driven perspective. In our approach, which we call LoMa, we combine large and diverse data mixtures, modern training recipes, scaled model capacity, and scaled compute, resulting in remarkable gains in performance. Since current standard benchmarks mainly rely on collecting sparse views from successful 3D reconstructions, the evaluation of progress in feature matching has been limited to relatively easy image pairs. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
