Locality defeats the curse of dimensionality in convolutional   teacher-student scenarios

Alessandro Favero; Francesco Cagnetta; Matthieu Wyart

arXiv:2106.08619·stat.ML·December 7, 2022

Locality defeats the curse of dimensionality in convolutional teacher-student scenarios

Alessandro Favero, Francesco Cagnetta, Matthieu Wyart

PDF

1 Video

TL;DR

This paper demonstrates that local receptive fields are crucial for the learning efficiency of convolutional neural networks, showing that locality, rather than translational invariance, primarily determines the learning curve exponent in teacher-student kernel regression models.

Contribution

It introduces a theoretical framework analyzing the roles of locality and invariance in CNNs, revealing locality's dominant influence on learning curves and providing empirical validation.

Findings

01

Locality determines the learning curve exponent in convolutional models.

02

Translational invariance does not significantly affect the learning rate.

03

Kernel regression with adaptive ridge yields similar learning behavior as ridgeless case.

Abstract

Convolutional neural networks perform a local and translationally-invariant treatment of the data: quantifying which of these two aspects is central to their success remains a challenge. We study this problem within a teacher-student framework for kernel regression, using `convolutional' kernels inspired by the neural tangent kernel of simple convolutional architectures of given filter size. Using heuristic methods from physics, we find in the ridgeless case that locality is key in determining the learning curve exponent $β$ (that relates the test error $ϵ_{t} \sim P^{- β}$ to the size of the training set $P$ ), whereas translational invariance is not. In particular, if the filter size of the teacher $t$ is smaller than that of the student $s$ , $β$ is a function of $s$ only and does not depend on the input dimension. We confirm our predictions on $β$ empirically. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Locality defeats the curse of dimensionality in convolutional teacher-student scenarios· slideslive