Statistical Properties of Deep Neural Networks with Dependent Data

Chad Brown

arXiv:2410.11113·stat.ML·January 16, 2025

Statistical Properties of Deep Neural Networks with Dependent Data

Chad Brown

PDF

Open Access

TL;DR

This paper analyzes the statistical properties of deep neural networks when trained on dependent data, providing convergence rates and error bounds applicable to common DNN architectures in regression and classification tasks.

Contribution

It introduces general results for nonparametric sieve estimators that are directly applicable to DNNs under dependent data, including convergence rates and error bounds.

Findings

01

Established convergence rates for DNN estimators with nonstationary data.

02

Derived non-asymptotic error bounds for stationary $eta$-mixing data.

03

Applicable to common fully connected feedforward networks with growing width and depth.

Abstract

This paper establishes statistical properties of deep neural network (DNN) estimators under dependent data. Two general results for nonparametric sieve estimators directly applicable to DNN estimators are given. The first establishes rates for convergence in probability under nonstationary data. The second provides non-asymptotic probability bounds on $L^{2}$ -errors under stationary $β$ -mixing data. I apply these results to DNN estimators in both regression and classification contexts imposing only a standard H\"older smoothness assumption. The DNN architectures considered are common in applications, featuring fully connected feedforward networks with any continuous piecewise linear activation function, unbounded weights, and a width and depth that grows with sample size. The framework provided also offers potential for research into other DNN architectures and time-series…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsLinear Regression