Inductive Bias of Deep Convolutional Networks through Pooling Geometry
Nadav Cohen, Amnon Shashua

TL;DR
This paper investigates how the pooling geometry in deep convolutional networks influences their inductive bias, particularly their ability to model correlations in natural images, through theoretical analysis and empirical validation.
Contribution
It introduces the concept of separation rank to quantify correlations and shows how pooling geometry controls the network's bias towards certain input partitions, explaining the success of convolutional networks.
Findings
Deep networks support exponential separation ranks for certain partitions.
Pooling geometry determines which input correlations are favored.
Shallow networks are limited to linear separation ranks.
Abstract
Our formal understanding of the inductive bias that drives the success of convolutional networks on computer vision tasks is limited. In particular, it is unclear what makes hypotheses spaces born from convolution and pooling operations so suitable for natural images. In this paper we study the ability of convolutional networks to model correlations among regions of their input. We theoretically analyze convolutional arithmetic circuits, and empirically validate our findings on other types of convolutional networks as well. Correlations are formalized through the notion of separation rank, which for a given partition of the input, measures how far a function is from being separable. We show that a polynomially sized deep network supports exponentially high separation ranks for certain input partitions, while being limited to polynomial separation ranks for others. The network's pooling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques
MethodsConvolution
