Universal Consistency of Deep Convolutional Neural Networks

Shao-Bo Lin; Kaidong Wang; Yao Wang; Ding-Xuan Zhou

arXiv:2106.12498·cs.LG·June 24, 2021

Universal Consistency of Deep Convolutional Neural Networks

Shao-Bo Lin, Kaidong Wang, Yao Wang, Ding-Xuan Zhou

PDF

Open Access

TL;DR

This paper proves that deep convolutional neural networks with expansive convolution are strongly universally consistent when trained with empirical risk minimization, and demonstrates their competitive performance without fully connected layers.

Contribution

It establishes the universal consistency of DCNNs with expansive convolution and compares their empirical performance to traditional hybrid models.

Findings

01

DCNNs with expansive convolution are strongly universally consistent.

02

Without fully connected layers, these DCNNs perform comparably to traditional models.

03

Empirical results support the theoretical findings.

Abstract

Compared with avid research activities of deep convolutional neural networks (DCNNs) in practice, the study of theoretical behaviors of DCNNs lags heavily behind. In particular, the universal consistency of DCNNs remains open. In this paper, we prove that implementing empirical risk minimization on DCNNs with expansive convolution (with zero-padding) is strongly universally consistent. Motivated by the universal consistency, we conduct a series of experiments to show that without any fully connected layers, DCNNs with expansive convolution perform not worse than the widely used deep neural networks with hybrid structure containing contracting (without zero-padding) convolution layers and several fully connected layers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvancements in Semiconductor Devices and Circuit Design · Ferroelectric and Negative Capacitance Devices · Adversarial Robustness in Machine Learning

MethodsConvolution