Concentration Phenomenon for Random Dynamical Systems: An Operator   Theoretic Approach

Muhammad Abdullah Naeem; Miroslav Pajic

arXiv:2212.03670·cs.LG·June 1, 2023

Concentration Phenomenon for Random Dynamical Systems: An Operator Theoretic Approach

Muhammad Abdullah Naeem, Miroslav Pajic

PDF

Open Access

TL;DR

This paper introduces an operator theoretic framework to establish concentration inequalities for unbounded observables in Markov chains, bypassing complex probabilistic methods and applicable to reinforcement learning scenarios.

Contribution

It develops a novel operator-based approach to derive sharp concentration bounds for unbounded functions in Markov systems, emphasizing the role of hyperboundedness and transport-entropy inequalities.

Findings

01

Operator methods replace probabilistic techniques for concentration bounds.

02

Sharp non-asymptotic bounds are derived for unbounded observables.

03

Reversibility is shown to be non-essential for concentration phenomena.

Abstract

Via operator theoretic methods, we formalize the concentration phenomenon for a given observable ` $r$ ' of a discrete time Markov chain with ` $μ_{π}$ ' as invariant ergodic measure, possibly having support on an unbounded state space. The main contribution of this paper is circumventing tedious probabilistic methods with a study of a composition of the Markov transition operator $P$ followed by a multiplication operator defined by $e^{r}$ . It turns out that even if the observable/ reward function is unbounded, but for some for some $q > 2$ , $∥ e^{r} ∥_{q \to 2} \propto exp (μ_{π} (r) + \frac{2 q}{q - 2})$ and $P$ is hyperbounded with norm control $∥ P ∥_{2 \to q} < e^{\frac{1}{2} [\frac{1}{2} - \frac{1}{q}]}$ , sharp non-asymptotic concentration bounds follow. \emph{Transport-entropy} inequality ensures the aforementioned upper bound on multiplication operator for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Diffusion and Search Dynamics · Gene Regulatory Network Analysis