Deep Networks Provably Classify Data on Curves
Tingran Wang, Sam Buchanan, Dar Gilboa, John Wright

TL;DR
This paper proves that deep neural networks can reliably classify data lying on smooth curves on the sphere, with guarantees depending on the network depth and intrinsic data properties, using NTK analysis.
Contribution
It provides the first generalization guarantee for deep networks classifying nonlinear data based solely on intrinsic geometric properties.
Findings
Deep networks with sufficient depth can classify data on curves with high probability.
The neural tangent kernel (NTK) can be approximated by a translationally invariant operator for deep networks.
Deeper networks improve the stability and invertibility of the NTK, aiding classification.
Abstract
Data with low-dimensional nonlinear structure are ubiquitous in engineering and scientific problems. We study a model problem with such structure -- a binary classification task that uses a deep fully-connected neural network to classify data drawn from two disjoint smooth curves on the unit sphere. Aside from mild regularity conditions, we place no restrictions on the configuration of the curves. We prove that when (i) the network depth is large relative to certain geometric properties that set the difficulty of the problem and (ii) the network width and number of samples is polynomial in the depth, randomly-initialized gradient descent quickly learns to correctly classify all points on the two curves with high probability. To our knowledge, this is the first generalization guarantee for deep networks with nonlinear data that depends only on intrinsic data properties. Our analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Advanced Neuroimaging Techniques and Applications
MethodsNeural Tangent Kernel
