Hoeffding's Inequality for Markov Chains under Generalized   Concentrability Condition

Hao Chen; Abhishek Gupta; Yin Sun; and Ness Shroff

arXiv:2310.02941·stat.ML·October 6, 2023·1 cites

Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

Hao Chen, Abhishek Gupta, Yin Sun, and Ness Shroff

PDF

Open Access

TL;DR

This paper extends Hoeffding's inequality to a broader class of Markov chains using a generalized concentrability condition based on IPM, enabling new non-asymptotic bounds in machine learning.

Contribution

It introduces a flexible framework for Hoeffding's inequality under generalized conditions, applicable beyond traditional ergodic Markov chains.

Findings

01

Generalization bound for empirical risk minimization with Markovian data

02

Finite sample guarantee for Ployak-Ruppert averaging of SGD

03

New regret bound for rested Markovian bandits with general state space

Abstract

This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM). The generalized concentrability condition establishes a framework that interpolates and extends the existing hypotheses of Markov chain Hoeffding-type inequalities. The flexibility of our framework allows Hoeffding's inequality to be applied beyond the ergodic Markov chains in the traditional sense. We demonstrate the utility by applying our framework to several non-asymptotic analyses arising from the field of machine learning, including (i) a generalization bound for empirical risk minimization with Markovian samples, (ii) a finite sample guarantee for Ployak-Ruppert averaging of SGD, and (iii) a new regret bound for rested Markovian bandits with general state space.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Gaussian Processes and Bayesian Inference

MethodsStochastic Gradient Descent