Learning Rotation-Equivariant Features for Visual Correspondence
Jongmin Lee, Byungjin Kim, Seungwook Kim, Minsu Cho

TL;DR
This paper presents a self-supervised framework using group-equivariant CNNs to learn rotation-invariant local features for image correspondence, improving robustness to rotations without data augmentation.
Contribution
Introduces a novel self-supervised learning method with group-equivariant CNNs and a group aligning technique for rotation-invariant feature extraction.
Findings
Achieves state-of-the-art rotation-invariant matching accuracy
Demonstrates robustness to geometric and photometric variations
Effective transfer to keypoint matching and pose estimation tasks
Abstract
Extracting discriminative local features that are invariant to imaging variations is an integral part of establishing correspondences between images. In this work, we introduce a self-supervised learning framework to extract discriminative rotation-invariant descriptors using group-equivariant CNNs. Thanks to employing group-equivariant CNNs, our method effectively learns to obtain rotation-equivariant features and their orientations explicitly, without having to perform sophisticated data augmentations. The resultant features and their orientations are further processed by group aligning, a novel invariant mapping technique that shifts the group-equivariant features by their orientations along the group dimension. Our group aligning technique achieves rotation-invariance without any collapse of the group dimension and thus eschews loss of discriminability. The proposed method is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage
