Concentration Phenomenon for Random Dynamical Systems: An Operator Theoretic Approach
Muhammad Abdullah Naeem, Miroslav Pajic

TL;DR
This paper introduces an operator theoretic framework to establish concentration inequalities for unbounded observables in Markov chains, bypassing complex probabilistic methods and applicable to reinforcement learning scenarios.
Contribution
It develops a novel operator-based approach to derive sharp concentration bounds for unbounded functions in Markov systems, emphasizing the role of hyperboundedness and transport-entropy inequalities.
Findings
Operator methods replace probabilistic techniques for concentration bounds.
Sharp non-asymptotic bounds are derived for unbounded observables.
Reversibility is shown to be non-essential for concentration phenomena.
Abstract
Via operator theoretic methods, we formalize the concentration phenomenon for a given observable `' of a discrete time Markov chain with `' as invariant ergodic measure, possibly having support on an unbounded state space. The main contribution of this paper is circumventing tedious probabilistic methods with a study of a composition of the Markov transition operator followed by a multiplication operator defined by . It turns out that even if the observable/ reward function is unbounded, but for some for some , and is hyperbounded with norm control , sharp non-asymptotic concentration bounds follow. \emph{Transport-entropy} inequality ensures the aforementioned upper bound on multiplication operator for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Diffusion and Search Dynamics · Gene Regulatory Network Analysis
