Inductive Bias of Deep Convolutional Networks through Pooling Geometry

Nadav Cohen; Amnon Shashua

arXiv:1605.06743·cs.NE·April 19, 2017·44 cites

Inductive Bias of Deep Convolutional Networks through Pooling Geometry

Nadav Cohen, Amnon Shashua

PDF

Open Access 1 Repo

TL;DR

This paper investigates how the pooling geometry in deep convolutional networks influences their inductive bias, particularly their ability to model correlations in natural images, through theoretical analysis and empirical validation.

Contribution

It introduces the concept of separation rank to quantify correlations and shows how pooling geometry controls the network's bias towards certain input partitions, explaining the success of convolutional networks.

Findings

01

Deep networks support exponential separation ranks for certain partitions.

02

Pooling geometry determines which input correlations are favored.

03

Shallow networks are limited to linear separation ranks.

Abstract

Our formal understanding of the inductive bias that drives the success of convolutional networks on computer vision tasks is limited. In particular, it is unclear what makes hypotheses spaces born from convolution and pooling operations so suitable for natural images. In this paper we study the ability of convolutional networks to model correlations among regions of their input. We theoretically analyze convolutional arithmetic circuits, and empirically validate our findings on other types of convolutional networks as well. Correlations are formalized through the notion of separation rank, which for a given partition of the input, measures how far a function is from being separable. We show that a polynomially sized deep network supports exponentially high separation ranks for certain input partitions, while being limited to polynomial separation ranks for others. The network's pooling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HUJI-Deep/inductive-pooling
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques

MethodsConvolution