Empirical Measure Large Deviations for Reinforced Chains on Finite   Spaces

Amarjit Budhiraja; Adam Waterbury

arXiv:2205.09291·math.PR·May 20, 2022·Syst. Control. Lett.

Empirical Measure Large Deviations for Reinforced Chains on Finite Spaces

Amarjit Budhiraja, Adam Waterbury

PDF

Open Access

TL;DR

This paper establishes a large deviation principle for empirical measures of reinforced chains on finite spaces, revealing a novel rate function linked to a deterministic control problem with discounted costs.

Contribution

It introduces a new large deviation principle for reinforced chains with a distinctive rate function different from classical Markov chain results.

Findings

01

Large deviation principle for reinforced chains established.

02

Rate function characterized by a deterministic control problem.

03

Different from Donsker-Varadhan rate function.

Abstract

Let $A$ be a transition probability kernel on a finite state space $Δ^{o} = {1, \dots, d}$ such that $A (x, y) > 0$ for all $x, y \in Δ^{o}$ . Consider a reinforced chain given as a sequence ${X_{n}, n \in N_{0}}$ of $Δ^{o}$ -valued random variables, defined recursively according to, $L^{n} = \frac{1}{n} i = 0 \sum n - 1 δ_{X_{i}}, P (X_{n + 1} \in \cdot ∣ X_{0}, \dots, X_{n}) = L^{n} A (\cdot) .$ We establish a large deviation principle for ${L^{n}}$ . The rate function takes a strikingly different form than the Donsker-Varadhan rate function associated with the empirical measure of the Markov chain with transition kernel $A$ and is described in terms of a novel deterministic infinite horizon discounted cost control problem with an associated linear controlled dynamics and a nonlinear running cost involving the relative entropy function. Proofs are based on an analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Stochastic processes and financial applications · Statistical Mechanics and Entropy