On Separation Between Learning and Control in Partially Observed Markov   Decision Processes

Andreas A. Malikopoulos

arXiv:2211.14972·math.OC·November 29, 2022

On Separation Between Learning and Control in Partially Observed Markov Decision Processes

Andreas A. Malikopoulos

PDF

Open Access

TL;DR

This paper proposes a theoretical framework that separates learning and control tasks in partially observed Markov decision processes, enabling the combination of offline model-based control with online learning to improve CPS management.

Contribution

It introduces a novel framework that decouples learning from control in POMDPs, addressing challenges of data volume and model discrepancies in CPS.

Findings

01

Framework facilitates combining offline control with online learning.

02

Addresses safety and robustness in CPS control strategies.

03

Enhances scalability of control in large data environments.

Abstract

Cyber-physical systems (CPS) encounter a large volume of data which is added to the system gradually in real time and not altogether in advance. As the volume of data increases, the domain of the control strategies also increases, and thus it becomes challenging to search for an optimal strategy. Even if an optimal control strategy is found, implementing such strategies with increasing domains is burdensome. To derive an optimal control strategy in CPS, we typically assume an ideal model of the system. Such model-based control approaches cannot effectively facilitate optimal solutions with performance guarantees due to the discrepancy between the model and the actual CPS. Alternatively, traditional supervised learning approaches cannot always facilitate robust solutions using data derived offline. Similarly, applying reinforcement learning approaches directly to the actual CPS might…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Software Reliability and Analysis Research · Fuel Cells and Related Materials