Adaptive Estimation of the Transition Density of Controlled Markov Chains

Imon Banerjee; Vinayak Rao; Harsha Honnappa

arXiv:2505.14458·math.ST·May 21, 2025

Adaptive Estimation of the Transition Density of Controlled Markov Chains

Imon Banerjee, Vinayak Rao, Harsha Honnappa

PDF

Open Access

TL;DR

This paper introduces an adaptive, assumption-free method for estimating transition densities in controlled Markov chains, applicable in various fields like reinforcement learning and time series analysis.

Contribution

It develops a novel estimator that does not require prior smoothness knowledge or control distribution assumptions, advancing non-parametric density estimation in controlled Markovian settings.

Findings

01

Provides oracle risk bounds for the estimator

02

Validates the method with both randomized and deterministic loss functions

03

Offers a flexible framework for transition density estimation

Abstract

Estimating the transition dynamics of controlled Markov chains is crucial in fields such as time series analysis, reinforcement learning, and system exploration. Traditional non-parametric density estimation methods often assume independent samples and require oracle knowledge of smoothness parameters like the H\"older continuity coefficient. These assumptions are unrealistic in controlled Markovian settings, especially when the controls are non-Markovian, since such parameters need to hold uniformly over all control values. To address this gap, we propose an adaptive estimator for the transition densities of controlled Markov chains that does not rely on prior knowledge of smoothness parameters or assumptions about the control sequence distribution. Our method builds upon recent advances in adaptive density estimation by selecting an estimator that minimizes a loss function {and}…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Gene Regulatory Network Analysis