Recursive Construction of Confidence Regions
Tomasz R. Bielecki, Tao Chen, Igor Cialenco

TL;DR
This paper introduces a recursive method for constructing confidence regions for parameters in Markov chain models, enabling faster computation and application in robust adaptive stochastic control.
Contribution
It develops a novel recursive scheme for confidence regions and introduces a quasi-asymptotically linear recursive estimator with proven consistency and normality.
Findings
Recursive confidence regions are computationally efficient.
The new estimator is weakly consistent and asymptotically normal.
Method facilitates dynamic programming in stochastic control.
Abstract
Assuming that one-step transition kernel of a discrete time, time-homogenous Markov chain model is parameterized by a parameter , we derive a recursive (in time) construction of confidence regions for the unknown parameter of interest, say . It is supposed that the observed data used in construction of the confidence regions is generated by a Markov chain whose transition kernel corresponds to . The key step in our construction is derivation of a recursive scheme for an appropriate point estimator of . To achieve this, we start by what we call the base recursive point estimator, using which we design a quasi-asymptotically linear recursive point estimator (a concept introduced in this paper). For the latter estimator we prove its weak consistency and asymptotic normality. The recursive construction of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
