RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching
Udit Singh Parihar, Aniket Gujarathi, Kinal Mehta, Satyajit Tourani,, Sourav Garg, Michael Milford, K. Madhava Krishna

TL;DR
This paper introduces a novel rotation-robust local descriptor framework that combines data augmentation and orthographic projection, significantly improving feature matching and place recognition under extreme viewpoint variations.
Contribution
It presents a new method for learning rotation-invariant descriptors using data augmentation and orthographic views, enhancing robustness in challenging viewpoint scenarios.
Findings
Outperforms baseline and state-of-the-art methods in key tasks
Enables higher place recognition accuracy across opposing viewpoints
Achieves useful performance under extreme viewpoint changes
Abstract
The use of local detectors and descriptors in typical computer vision pipelines work well until variations in viewpoint and appearance change become extreme. Past research in this area has typically focused on one of two approaches to this challenge: the use of projections into spaces more suitable for feature matching under extreme viewpoint changes, and attempting to learn features that are inherently more robust to viewpoint change. In this paper, we present a novel framework that combines learning of invariant descriptors through data augmentation and orthographic viewpoint projection. We propose rotation-robust local descriptors, learnt through training data augmentation based on rotation homographies, and a correspondence ensemble technique that combines vanilla feature correspondences with those obtained through rotation-robust features. Using a range of benchmark datasets as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
