A Mathematical Theory of Deep Convolutional Neural Networks for Feature   Extraction

Thomas Wiatowski; Helmut B\"olcskei

arXiv:1512.06293·cs.IT·October 25, 2017

A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

Thomas Wiatowski, Helmut B\"olcskei

PDF

TL;DR

This paper develops a comprehensive mathematical framework for deep convolutional neural networks, analyzing their feature extraction capabilities, translation invariance, and deformation stability across various transforms, nonlinearities, and pooling methods.

Contribution

It extends Mallat's scattering network analysis to general convolutional transforms, nonlinearities, and pooling, providing new theoretical insights into deep CNNs' invariance and stability properties.

Findings

01

Proves translation invariance increases with network depth.

02

Establishes deformation sensitivity bounds for various signal classes.

03

Applies to a wide range of convolutional transforms and nonlinearities.

Abstract

Deep convolutional neural networks have led to breakthrough results in numerous practical machine learning tasks such as classification of images in the ImageNet data set, control-policy-learning to play Atari games or the board game Go, and image captioning. Many of these applications first perform feature extraction and then feed the results thereof into a trainable classifier. The mathematical analysis of deep convolutional neural networks for feature extraction was initiated by Mallat, 2012. Specifically, Mallat considered so-called scattering networks based on a wavelet transform followed by the modulus non-linearity in each network layer, and proved translation invariance (asymptotically in the wavelet scale parameter) and deformation stability of the corresponding feature extractor. This paper complements Mallat's results by developing a theory that encompasses general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.