Deep Rotation Equivariant Network

Junying Li; Zichen Yang; Haifeng Liu; Deng Cai

arXiv:1705.08623·cs.CV·March 1, 2018·6 cites

Deep Rotation Equivariant Network

Junying Li, Zichen Yang, Haifeng Liu, Deng Cai

PDF

Open Access 2 Repos

TL;DR

The paper introduces Deep Rotation Equivariant Network (DREN), which efficiently learns rotation-equivariant features by applying transformations on filters, significantly reducing computational overhead while improving performance on rotated image datasets.

Contribution

It proposes a novel architecture with cycle, isotonic, and decycle layers that apply rotation on filters instead of feature maps, enhancing efficiency and accuracy.

Findings

01

Over 2x speedup compared to previous methods

02

Reduced memory overhead in rotation-equivariant learning

03

Improved accuracy on Rotated MNIST and CIFAR-10 datasets

Abstract

Recently, learning equivariant representations has attracted considerable research attention. Dieleman et al. introduce four operations which can be inserted into convolutional neural network to learn deep representations equivariant to rotation. However, feature maps should be copied and rotated four times in each layer in their approach, which causes much running time and memory overhead. In order to address this problem, we propose Deep Rotation Equivariant Network consisting of cycle layers, isotonic layers and decycle layers. Our proposed layers apply rotation transformation on filters rather than feature maps, achieving a speed up of more than 2 times with even less memory overhead. We evaluate DRENs on Rotated MNIST and CIFAR-10 datasets and demonstrate that it can improve the performance of state-of-the-art architectures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Handwritten Text Recognition Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings