Sequential Stochastic Control (Single or Multi-Agent) Problems Nearly Admit Change of Measures with Independent Measurements
Ian Hogeboom-Burr, Serdar Y\"uksel

TL;DR
This paper investigates whether adding small noise to measurements in stochastic control problems can make them amenable to change-of-measure techniques, showing it is possible under certain conditions.
Contribution
It demonstrates that any bounded, continuous-cost stochastic control problem with convex actions can be approximated arbitrarily closely by a system that admits change-of-measure methods.
Findings
Perturbed systems can be made static reducible with arbitrarily small error.
The approach applies to both single-agent and multi-agent decentralized control.
The solutions for perturbed systems are implementable in the original models.
Abstract
Change of measures has been an effective method in stochastic control and analysis; in continuous-time control this follows Girsanov's theorem applied to both fully observed and partially observed models, in decentralized stochastic control (or stochastic dynamic team theory) this is known as Witsenhausen's static reduction, and in discrete-time classical stochastic control Borkar has considered this method for partially observed Markov Decision processes (POMDPs) generalizing Fleming and Pardoux's approach in continuous-time. This method allows for equivalent optimal stochastic control or filtering in a new probability space where the measurements form an independent exogenous process in both discrete-time and continuous-time and the Radon-Nikodym derivative (between the true measure and the reference measure formed via the independent measurement process) is pushed to the cost or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Economic theories and models · Advanced Control Systems Optimization
