RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint   Detection and Invariant Description for Endoscopy

Mert Asim Karaoglu; Viktoria Markova; Nassir Navab; Benjamin Busam,; and Alexander Ladikos

arXiv:2309.09563·cs.CV·September 19, 2023

RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy

Mert Asim Karaoglu, Viktoria Markova, Nassir Navab, Benjamin Busam,, and Alexander Ladikos

PDF

Open Access

TL;DR

RIDE is a self-supervised learning method that achieves rotation-equivariant keypoint detection and invariant description in endoscopy, addressing the challenge of large rotational motions without manual labels.

Contribution

It introduces a novel architecture that models rotation-equivariance implicitly, trained self-supervised on endoscopic images, and sets new state-of-the-art results in keypoint matching and pose estimation.

Findings

01

Outperforms recent learning-based approaches in matching tasks.

02

Achieves state-of-the-art in relative pose estimation.

03

Demonstrates robustness to large rotations in endoscopic images.

Abstract

Unlike in natural images, in endoscopy there is no clear notion of an up-right camera orientation. Endoscopic videos therefore often contain large rotational motions, which require keypoint detection and description algorithms to be robust to these conditions. While most classical methods achieve rotation-equivariant detection and invariant description by design, many learning-based approaches learn to be robust only up to a certain degree. At the same time learning-based methods under moderate rotations often outperform classical approaches. In order to address this shortcoming, in this paper we propose RIDE, a learning-based method for rotation-equivariant detection and invariant description. Following recent advancements in group-equivariant learning, RIDE models rotation-equivariance implicitly within its architecture. Trained in a self-supervised manner on a large curation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Medical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques