Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Johann Schmidt, Sebastian Stober

TL;DR
This paper introduces Inverse Transformation Search (ITS), a novel, model-agnostic inference method that enhances neural network robustness to spatial transformations without additional training, inspired by human perceptual actions.
Contribution
The paper presents a new inference technique, ITS, that achieves zero-shot pseudo-invariance to spatial transformations, addressing robustness issues without extensive data augmentation or complex inductive biases.
Findings
ITS outperforms baseline methods on multiple benchmark datasets.
The method provides zero-shot robustness to spatial transformations.
Experimental results demonstrate improved generalization under transformed inputs.
Abstract
Deep neural networks are applied in more and more areas of everyday life. However, they still lack essential abilities, such as robustly dealing with spatially transformed input signals. Approaches to mitigate this severe robustness issue are limited to two pathways: Either models are implicitly regularised by increased sample variability (data augmentation) or explicitly constrained by hard-coded inductive biases. The limiting factor of the former is the size of the data space, which renders sufficient sample coverage intractable. The latter is limited by the engineering effort required to develop such inductive biases for every possible scenario. Instead, we take inspiration from human behaviour, where percepts are modified by mental or physical actions during inference. We propose a novel technique to emulate such an inference process for neural nets. This is achieved by traversing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Machine Learning and Data Classification · Face and Expression Recognition
