Scale-invariant scale-channel networks: Deep networks that generalise to previously unseen scales
Ylva Jansson, Tony Lindeberg

TL;DR
This paper systematically studies scale-channel networks' ability to generalize to unseen scales, introduces a new foveated architecture, and demonstrates its superior performance in handling large scale variations in visual tasks.
Contribution
The paper develops a formal analysis of scale channel networks, identifies limitations of previous designs, and proposes a novel foveated architecture that generalizes well to unseen scales.
Findings
Previous scale channel designs do not generalize well to unseen scales.
The proposed foveated scale channel networks achieve near-perfect generalization over a scale range of 8.
Foveated networks outperform prior methods, especially with limited training data.
Abstract
The ability to handle large scale variations is crucial for many real world visual tasks. A straightforward approach for handling scale in a deep network is to process an image at several scales simultaneously in a set of scale channels. Scale invariance can then, in principle, be achieved by using weight sharing between the scale channels together with max or average pooling over the outputs from the scale channels. The ability of such scale channel networks to generalise to scales not present in the training set over significant scale ranges has, however, not previously been explored. In this paper, we present a systematic study of this methodology by implementing different types of scale channel networks and evaluating their ability to generalise to previously unseen scales. We develop a formalism for analysing the covariance and invariance properties of scale channel networks, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAverage Pooling
