Dimension-independent rates for structured neural density estimation

Robert A. Vandermeulen; Wai Ming Tai; Bryon Aragam

arXiv:2411.15095·stat.ML·November 25, 2024

Dimension-independent rates for structured neural density estimation

Robert A. Vandermeulen, Wai Ming Tai, Bryon Aragam

PDF

Open Access 1 Video

TL;DR

This paper proves that deep neural networks can learn structured densities like images and text with convergence rates independent of ambient dimension, depending only on the structure's maximum clique size.

Contribution

It establishes dimension-independent convergence rates for neural density estimation based on the underlying Markov structure, providing theoretical support for deep learning's effectiveness in high-dimensional data.

Findings

01

Neural networks achieve rate $n^{-1/(4+r)}$ in density estimation.

02

Optimal $L^1$ rate is $n^{-1/(2+r)}$, depending on clique size.

03

Rates are independent of ambient data dimension.

Abstract

We show that deep neural networks achieve dimension-independent rates of convergence for learning structured densities such as those arising in image, audio, video, and text applications. More precisely, we demonstrate that neural networks with a simple $L^{2}$ -minimizing loss achieve a rate of $n^{- 1/ (4 + r)}$ in nonparametric density estimation when the underlying density is Markov to a graph whose maximum clique size is at most $r$ , and we provide evidence that in the aforementioned applications, this size is typically constant, i.e., $r = O (1)$ . We then establish that the optimal rate in $L^{1}$ is $n^{- 1/ (2 + r)}$ which, compared to the standard nonparametric rate of $n^{- 1/ (2 + d)}$ , reveals that the effective dimension of such problems is the size of the largest clique in the Markov random field. These rates are independent of the data's ambient dimension, making them applicable to realistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Dimension-Independent Rates for Structured Neural Density Estimation· slideslive

Taxonomy

TopicsNeural Networks and Applications · Neural dynamics and brain function