Generalization Error Bounds on Deep Learning with Markov Datasets

Lan V. Truong

arXiv:2201.11059·stat.ML·October 13, 2022·1 cites

Generalization Error Bounds on Deep Learning with Markov Datasets

Lan V. Truong

PDF

Open Access 1 Video

TL;DR

This paper establishes upper bounds on the generalization error of deep neural networks trained on Markov datasets, extending existing bounds to non-i.i.d. data using new probabilistic inequalities.

Contribution

It introduces novel symmetrization inequalities for Markov chains and adapts generalization bounds to Markov and Bayesian data settings in deep learning.

Findings

01

Derived bounds depend on the spectral gap of the Markov chain.

02

Extended bounds to AR, ARMA, and mixture models.

03

Proposed a method to convert traditional bounds to Bayesian counterparts.

Abstract

In this paper, we derive upper bounds on generalization errors for deep neural networks with Markov datasets. These bounds are developed based on Koltchinskii and Panchenko's approach for bounding the generalization error of combined classifiers with i.i.d. datasets. The development of new symmetrization inequalities in high-dimensional probability for Markov chains is a key element in our extension, where the spectral gap of the infinitesimal generator of the Markov chain plays a key parameter in these inequalities. We also propose a simple method to convert these bounds and other similar ones in traditional deep learning and machine learning to Bayesian counterparts for both i.i.d. and Markov datasets. Extensions to $m$ -order homogeneous Markov chains such as AR and ARMA models and mixtures of several Markov data services are given.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Generalization Error Bounds on Deep Learning with Markov Datasets· slideslive

Taxonomy

TopicsBayesian Methods and Mixture Models · Bayesian Modeling and Causal Inference · Markov Chains and Monte Carlo Methods

MethodsARMA GNN