Centralized Reduction of Decentralized Stochastic Control Models and their weak-Feller Regularity
Omar Mrani-Zentar, Serdar Y\"uksel

TL;DR
This paper demonstrates how certain decentralized stochastic control problems can be transformed into centralized MDPs, establishing conditions for regularity and optimal policies, thereby enabling approximation and learning approaches.
Contribution
It extends prior results by providing a unified reduction framework for decentralized control problems with general spaces and analyzing their regularity properties.
Findings
Decentralized problems can be reduced to centralized MDPs under specific information structures.
Conditions for weak-Feller regularity of the transition kernels are established.
Existence of optimal policies and their separated nature are proven.
Abstract
Decentralized stochastic control problems involving general state/measurement/action spaces are intrinsically difficult to study because of the inapplicability of standard tools from centralized (single-agent) stochastic control. In this paper, we address some of these challenges for decentralized stochastic control with standard Borel spaces under two different but tightly related information structures: the one-step delayed information sharing pattern (OSDISP), and the -step periodic information sharing pattern (KSPISP). We will show that the one-step delayed and -step periodic problems can be reduced to a centralized Markov Decision Process (MDP), generalizing prior results which considered finite, linear, or static models, by addressing several measurability and topological questions. We then provide sufficient conditions for the transition kernels of both centralized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Stability and Control of Uncertain Systems · Advanced Control Systems Optimization
