VC dimension of partially quantized neural networks in the   overparametrized regime

Yutong Wang; Clayton D. Scott

arXiv:2110.02456·stat.ML·October 7, 2021

VC dimension of partially quantized neural networks in the overparametrized regime

Yutong Wang, Clayton D. Scott

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces hyperplane arrangement neural networks (HANNs), a class of partially quantized networks with VC dimension smaller than their number of weights, explaining their good generalization in overparametrized regimes.

Contribution

The paper demonstrates that HANNs can have significantly smaller VC dimension than their number of weights while maintaining high expressivity and competitive performance.

Findings

01

HANNs have VC dimension smaller than the number of weights.

02

Empirical risk minimization over HANNs achieves minimax classification rates.

03

HANNs match state-of-the-art performance on UCI datasets.

Abstract

Vapnik-Chervonenkis (VC) theory has so far been unable to explain the small generalization error of overparametrized neural networks. Indeed, existing applications of VC theory to large networks obtain upper bounds on VC dimension that are proportional to the number of weights, and for a large class of networks, these upper bound are known to be tight. In this work, we focus on a class of partially quantized networks that we refer to as hyperplane arrangement neural networks (HANNs). Using a sample compression analysis, we show that HANNs can have VC dimension significantly smaller than the number of weights, while being highly expressive. In particular, empirical risk minimization over HANNs in the overparametrized regime achieves the minimax rate for classification with Lipschitz posterior class probability. We further demonstrate the expressivity of HANNs empirically. On a panel of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yutongwangumich/hann
tfOfficial

Videos

VC dimension of partially quantized neural networks in the overparametrized regime· slideslive

Taxonomy

TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms