Spatio-temporal Value Semantics-based Abstraction for Dense Deep Reinforcement Learning
Jihui Nie, Dehui Du, Jiangnan Zhao

TL;DR
This paper introduces a novel spatio-temporal value semantics-based abstraction method for deep reinforcement learning in cyber-physical systems, improving decision-making efficiency and generalization in complex environments.
Contribution
It proposes a semantics-based abstraction to construct an abstract MDP for DRL, addressing challenges of uncertainty and data scarcity in dynamic ICPS environments.
Findings
Effective abstraction reduces semantic gaps between models.
Improved decision-making in lane-keeping and cruise control tasks.
Enhanced generalization in complex, dynamic scenarios.
Abstract
Intelligent Cyber-Physical Systems (ICPS) represent a specialized form of Cyber-Physical System (CPS) that incorporates intelligent components, notably Convolutional Neural Networks (CNNs) and Deep Reinforcement Learning (DRL), to undertake multifaceted tasks encompassing perception, decision-making, and control. The utilization of DRL for decision-making facilitates dynamic interaction with the environment, generating control actions aimed at maximizing cumulative rewards. Nevertheless, the inherent uncertainty of the operational environment and the intricate nature of ICPS necessitate exploration within complex and dynamic state spaces during the learning phase. DRL confronts challenges in terms of efficiency, generalization capabilities, and data scarcity during decision-making process. In response to these challenges, we propose an innovative abstract modeling approach grounded in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
