TL;DR
This paper introduces $SE(2)$ group convolutional layers into CNNs to achieve rotation and translation covariance, improving medical image analysis performance without data augmentation.
Contribution
It develops a novel $SE(2)$ group convolution framework for CNNs, enabling rotation and translation covariant processing in medical imaging tasks.
Findings
Achieves state-of-the-art results in histopathology, retinal imaging, and electron microscopy.
Outperforms standard CNNs that rely on data augmentation.
Provides a unified geometric approach to handle rotations in deep learning.
Abstract
We propose a framework for rotation and translation covariant deep learning using group convolutions. The group product of the special Euclidean motion group describes how a concatenation of two roto-translations results in a net roto-translation. We encode this geometric structure into convolutional neural networks (CNNs) via group convolutional layers, which fit into the standard 2D CNN framework, and which allow to generically deal with rotated input samples without the need for data augmentation. We introduce three layers: a lifting layer which lifts a 2D (vector valued) image to an -image, i.e., 3D (vector valued) data whose domain is ; a group convolution layer from and to an -image; and a projection layer from an -image to a 2D image. The lifting and group convolution layers are covariant (the output roto-translates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
