Approximation and Learning with Deep Convolutional Models: a Kernel Perspective
Alberto Bietti

TL;DR
This paper analyzes deep convolutional networks using kernel methods, revealing their functional space, inductive biases, and how pooling improves sample complexity, providing theoretical insights into their empirical success.
Contribution
It introduces hierarchical kernels inspired by convolutional networks, characterizes their RKHS, and offers generalization bounds explaining pooling's benefits.
Findings
RKHS consists of additive patch interaction models
Pooling encourages spatial similarity in the RKHS
Pooling and patches improve sample complexity guarantees
Abstract
The empirical success of deep convolutional networks on tasks involving high-dimensional data such as images or audio suggests that they can efficiently approximate certain functions that are well-suited for such tasks. In this paper, we study this through the lens of kernel methods, by considering simple hierarchical kernels with two or three convolution and pooling layers, inspired by convolutional kernel networks. These achieve good empirical performance on standard vision datasets, while providing a precise description of their functional space that yields new insights on their inductive bias. We show that the RKHS consists of additive models of interaction terms between patches, and that its norm encourages spatial similarities between these terms through pooling layers. We then provide generalization bounds which illustrate how pooling and patches yield improved sample complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
MethodsConvolution
