Training or Architecture? How to Incorporate Invariance in Neural   Networks

Kanchana Vaishnavi Gandikota; Jonas Geiping; Zorah L\"ahner; Adam; Czapli\'nski; Michael Moeller

arXiv:2106.10044·cs.CV·June 21, 2021

Training or Architecture? How to Incorporate Invariance in Neural Networks

Kanchana Vaishnavi Gandikota, Jonas Geiping, Zorah L\"ahner, Adam, Czapli\'nski, Michael Moeller

PDF

Open Access

TL;DR

This paper introduces a method for creating neural networks that are provably invariant to transformations like rotations and scaling by selecting a representative from each transformation orbit, enhancing robustness and efficiency.

Contribution

The authors propose a novel approach to achieve provable invariance in neural networks by choosing a representative element from the transformation orbit, applicable to continuous groups.

Findings

01

Demonstrated robustness to image rotations with discretization artifacts

02

Achieved provable rotational and scaling invariance in 3D point cloud classification

03

Improved computational efficiency over traditional invariance methods

Abstract

Many applications require the robustness, or ideally the invariance, of a neural network to certain transformations of input data. Most commonly, this requirement is addressed by either augmenting the training data, using adversarial training, or defining network architectures that include the desired invariance automatically. Unfortunately, the latter often relies on the ability to enlist all possible transformations, which make such approaches largely infeasible for infinite sets of transformations, such as arbitrary rotations or scaling. In this work, we propose a method for provably invariant network architectures with respect to group actions by choosing one element from a (possibly continuous) orbit based on a fixed criterion. In a nutshell, we intend to 'undo' any possible transformation before feeding the data into the actual network. We analyze properties of such approaches,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis