Optimal Convergence Rates of Deep Neural Network Classifiers

Zihan Zhang; Lei Shi; Ding-Xuan Zhou

arXiv:2506.14899·stat.ML·November 24, 2025

Optimal Convergence Rates of Deep Neural Network Classifiers

Zihan Zhang, Lei Shi, Ding-Xuan Zhou

PDF

Open Access

TL;DR

This paper establishes the optimal convergence rates for deep neural network classifiers under certain compositional and noise conditions, showing that ReLU DNNs can achieve these rates independently of input dimension.

Contribution

It derives the optimal convergence rate for DNN classifiers under compositional assumptions and demonstrates that ReLU DNNs can attain this rate up to a logarithmic factor.

Findings

01

Optimal convergence rate independent of input dimension d

02

ReLU DNNs trained with hinge loss achieve the rate

03

Theoretical justification for DNN performance in high-dimensional classification

Abstract

In this paper, we study the binary classification problem on $[0, 1]^{d}$ under the Tsybakov noise condition (with exponent $s \in [0, \infty]$ ) and the compositional assumption. This assumption requires the conditional class probability function of the data distribution to be the composition of $q + 1$ vector-valued multivariate functions, where each component function is either a maximum value function or a H\"{o}lder- $β$ smooth function that depends only on $d_{*}$ of its input variables. Notably, $d_{*}$ can be significantly smaller than the input dimension $d$ . We prove that, under these conditions, the optimal convergence rate for the excess 0-1 risk of classifiers is $(\frac{1}{n})^{\frac{β \cdot ( 1 \land β ) ^{q}}{\frac{d _{*}}{s + 1} + ( 1 + \frac{1}{s + 1} ) \cdot β \cdot ( 1 \land β ) ^{q}}}$ , which is independent of the input dimension $d$ . Additionally, we demonstrate that ReLU…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications