Training or Architecture? How to Incorporate Invariance in Neural Networks
Kanchana Vaishnavi Gandikota, Jonas Geiping, Zorah L\"ahner, Adam, Czapli\'nski, Michael Moeller

TL;DR
This paper introduces a method for creating neural networks that are provably invariant to transformations like rotations and scaling by selecting a representative from each transformation orbit, enhancing robustness and efficiency.
Contribution
The authors propose a novel approach to achieve provable invariance in neural networks by choosing a representative element from the transformation orbit, applicable to continuous groups.
Findings
Demonstrated robustness to image rotations with discretization artifacts
Achieved provable rotational and scaling invariance in 3D point cloud classification
Improved computational efficiency over traditional invariance methods
Abstract
Many applications require the robustness, or ideally the invariance, of a neural network to certain transformations of input data. Most commonly, this requirement is addressed by either augmenting the training data, using adversarial training, or defining network architectures that include the desired invariance automatically. Unfortunately, the latter often relies on the ability to enlist all possible transformations, which make such approaches largely infeasible for infinite sets of transformations, such as arbitrary rotations or scaling. In this work, we propose a method for provably invariant network architectures with respect to group actions by choosing one element from a (possibly continuous) orbit based on a fixed criterion. In a nutshell, we intend to 'undo' any possible transformation before feeding the data into the actual network. We analyze properties of such approaches,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
