Optimal Convergence Rates of Deep Neural Network Classifiers
Zihan Zhang, Lei Shi, Ding-Xuan Zhou

TL;DR
This paper establishes the optimal convergence rates for deep neural network classifiers under certain compositional and noise conditions, showing that ReLU DNNs can achieve these rates independently of input dimension.
Contribution
It derives the optimal convergence rate for DNN classifiers under compositional assumptions and demonstrates that ReLU DNNs can attain this rate up to a logarithmic factor.
Findings
Optimal convergence rate independent of input dimension d
ReLU DNNs trained with hinge loss achieve the rate
Theoretical justification for DNN performance in high-dimensional classification
Abstract
In this paper, we study the binary classification problem on under the Tsybakov noise condition (with exponent ) and the compositional assumption. This assumption requires the conditional class probability function of the data distribution to be the composition of vector-valued multivariate functions, where each component function is either a maximum value function or a H\"{o}lder- smooth function that depends only on of its input variables. Notably, can be significantly smaller than the input dimension . We prove that, under these conditions, the optimal convergence rate for the excess 0-1 risk of classifiers is , which is independent of the input dimension . Additionally, we demonstrate that ReLU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
