RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy
Mert Asim Karaoglu, Viktoria Markova, Nassir Navab, Benjamin Busam,, and Alexander Ladikos

TL;DR
RIDE is a self-supervised learning method that achieves rotation-equivariant keypoint detection and invariant description in endoscopy, addressing the challenge of large rotational motions without manual labels.
Contribution
It introduces a novel architecture that models rotation-equivariance implicitly, trained self-supervised on endoscopic images, and sets new state-of-the-art results in keypoint matching and pose estimation.
Findings
Outperforms recent learning-based approaches in matching tasks.
Achieves state-of-the-art in relative pose estimation.
Demonstrates robustness to large rotations in endoscopic images.
Abstract
Unlike in natural images, in endoscopy there is no clear notion of an up-right camera orientation. Endoscopic videos therefore often contain large rotational motions, which require keypoint detection and description algorithms to be robust to these conditions. While most classical methods achieve rotation-equivariant detection and invariant description by design, many learning-based approaches learn to be robust only up to a certain degree. At the same time learning-based methods under moderate rotations often outperform classical approaches. In order to address this shortcoming, in this paper we propose RIDE, a learning-based method for rotation-equivariant detection and invariant description. Following recent advancements in group-equivariant learning, RIDE models rotation-equivariance implicitly within its architecture. Trained in a self-supervised manner on a large curation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Medical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques
