Empirical Measure Large Deviations for Reinforced Chains on Finite Spaces
Amarjit Budhiraja, Adam Waterbury

TL;DR
This paper establishes a large deviation principle for empirical measures of reinforced chains on finite spaces, revealing a novel rate function linked to a deterministic control problem with discounted costs.
Contribution
It introduces a new large deviation principle for reinforced chains with a distinctive rate function different from classical Markov chain results.
Findings
Large deviation principle for reinforced chains established.
Rate function characterized by a deterministic control problem.
Different from Donsker-Varadhan rate function.
Abstract
Let be a transition probability kernel on a finite state space such that for all . Consider a reinforced chain given as a sequence of -valued random variables, defined recursively according to, We establish a large deviation principle for . The rate function takes a strikingly different form than the Donsker-Varadhan rate function associated with the empirical measure of the Markov chain with transition kernel and is described in terms of a novel deterministic infinite horizon discounted cost control problem with an associated linear controlled dynamics and a nonlinear running cost involving the relative entropy function. Proofs are based on an analysis of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Stochastic processes and financial applications · Statistical Mechanics and Entropy
