Minimax optimality of deep neural networks on dependent data via PAC-Bayes bounds
Pierre Alquier, William Kengne

TL;DR
This paper extends the minimax optimality results of deep neural networks to dependent data modeled as Markov chains, providing theoretical guarantees for both regression and classification tasks using PAC-Bayes bounds.
Contribution
It generalizes previous minimax optimality results to dependent data scenarios and introduces PAC-Bayes bounds for analyzing deep neural network estimators in this context.
Findings
DNN estimators are minimax optimal for dependent data.
Derived upper bounds on estimation risk match known lower bounds.
Extended theoretical guarantees to Markov chain data.
Abstract
In a groundbreaking work, Schmidt-Hieber (2020) proved the minimax optimality of deep neural networks with ReLu activation for least-square regression estimation over a large class of functions defined by composition. In this paper, we extend these results in many directions. First, we remove the i.i.d. assumption on the observations, to allow some time dependence. The observations are assumed to be a Markov chain with a non-null pseudo-spectral gap. Then, we study a more general class of machine learning problems, which includes least-square and logistic regression as special cases. Leveraging on PAC-Bayes oracle inequalities and a version of Bernstein inequality due to Paulin (2015), we derive upper bounds on the estimation risk for a generalized Bayesian estimator. In the case of least-square regression, this bound matches (up to a logarithmic factor) the lower bound of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Distributed Sensor Networks and Detection Algorithms
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Logistic Regression
