Adaptive Estimation of the Transition Density of Controlled Markov Chains
Imon Banerjee, Vinayak Rao, Harsha Honnappa

TL;DR
This paper introduces an adaptive, assumption-free method for estimating transition densities in controlled Markov chains, applicable in various fields like reinforcement learning and time series analysis.
Contribution
It develops a novel estimator that does not require prior smoothness knowledge or control distribution assumptions, advancing non-parametric density estimation in controlled Markovian settings.
Findings
Provides oracle risk bounds for the estimator
Validates the method with both randomized and deterministic loss functions
Offers a flexible framework for transition density estimation
Abstract
Estimating the transition dynamics of controlled Markov chains is crucial in fields such as time series analysis, reinforcement learning, and system exploration. Traditional non-parametric density estimation methods often assume independent samples and require oracle knowledge of smoothness parameters like the H\"older continuity coefficient. These assumptions are unrealistic in controlled Markovian settings, especially when the controls are non-Markovian, since such parameters need to hold uniformly over all control values. To address this gap, we propose an adaptive estimator for the transition densities of controlled Markov chains that does not rely on prior knowledge of smoothness parameters or assumptions about the control sequence distribution. Our method builds upon recent advances in adaptive density estimation by selecting an estimator that minimizes a loss function {and}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Gene Regulatory Network Analysis
