TL;DR
This paper investigates where to incorporate rotation invariance in feature matching pipelines, finding that early descriptor-based invariance improves speed and robustness without sacrificing upright performance.
Contribution
It demonstrates that embedding rotation invariance in descriptors is as effective as in matchers and enables faster, more robust image matching across diverse datasets.
Findings
Incorporating rotation invariance in descriptors yields similar performance to matcher-based invariance.
Early descriptor invariance allows for faster rotation-invariant matchers.
Increasing training data size improves generalization to rotated images.
Abstract
Finding matching keypoints between images is a core problem in 3D computer vision. However, modern matchers struggle with large in-plane rotations. A straightforward mitigation is to learn rotation invariance via data augmentation. However, it remains unclear at which stage rotation invariance should be incorporated. In this paper, we study this in the context of a modern sparse matching pipeline. We perform extensive experiments by training on a large collection of 3D vision datasets and evaluating on popular image matching benchmarks. Surprisingly, we find that incorporating rotation invariance already in the descriptor yields similar performance to handling it in the matcher. However, rotation invariance is achieved earlier in the matcher when it is learned in the descriptor, allowing for a faster rotation-invariant matcher. Further, we find that enforcing rotation invariance does…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
