Learning Stable Group Invariant Representations with Convolutional   Networks

Joan Bruna; Arthur Szlam; Yann LeCun

arXiv:1301.3537·cs.AI·January 17, 2013·ICLR·19 cites

Learning Stable Group Invariant Representations with Convolutional Networks

Joan Bruna, Arthur Szlam, Yann LeCun

PDF

Open Access

TL;DR

This paper explores how deep convolutional networks can learn stable, invariant representations by controlling the invariance group through architecture and filters, extending beyond traditional physical transformation groups.

Contribution

It demonstrates that deep CNNs can be understood as learning stable invariance groups, with architecture and filters shaping the invariance properties and enabling more abstract representations.

Findings

01

CNN architecture determines the invariance group

02

Trainable filters characterize the group action

03

Additional layers enable more abstract invariance

Abstract

Transformation groups, such as translations or rotations, effectively express part of the variability observed in many recognition problems. The group structure enables the construction of invariant signal representations with appealing mathematical properties, where convolutions, together with pooling operators, bring stability to additive and geometric perturbations of the input. Whereas physical transformation groups are ubiquitous in image and audio applications, they do not account for all the variability of complex signal classes. We show that the invariance properties built by deep convolutional networks can be cast as a form of stable group invariance. The network wiring architecture determines the invariance group, while the trainable filter coefficients characterize the group action. We give explanatory examples which illustrate how the network architecture controls the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Neural Networks and Applications