Spatio-temporal Value Semantics-based Abstraction for Dense Deep   Reinforcement Learning

Jihui Nie; Dehui Du; Jiangnan Zhao

arXiv:2405.15829·cs.LG·May 28, 2024

Spatio-temporal Value Semantics-based Abstraction for Dense Deep Reinforcement Learning

Jihui Nie, Dehui Du, Jiangnan Zhao

PDF

Open Access

TL;DR

This paper introduces a novel spatio-temporal value semantics-based abstraction method for deep reinforcement learning in cyber-physical systems, improving decision-making efficiency and generalization in complex environments.

Contribution

It proposes a semantics-based abstraction to construct an abstract MDP for DRL, addressing challenges of uncertainty and data scarcity in dynamic ICPS environments.

Findings

01

Effective abstraction reduces semantic gaps between models.

02

Improved decision-making in lane-keeping and cruise control tasks.

03

Enhanced generalization in complex, dynamic scenarios.

Abstract

Intelligent Cyber-Physical Systems (ICPS) represent a specialized form of Cyber-Physical System (CPS) that incorporates intelligent components, notably Convolutional Neural Networks (CNNs) and Deep Reinforcement Learning (DRL), to undertake multifaceted tasks encompassing perception, decision-making, and control. The utilization of DRL for decision-making facilitates dynamic interaction with the environment, generating control actions aimed at maximizing cumulative rewards. Nevertheless, the inherent uncertainty of the operational environment and the intricate nature of ICPS necessitate exploration within complex and dynamic state spaces during the learning phase. DRL confronts challenges in terms of efficiency, generalization capabilities, and data scarcity during decision-making process. In response to these challenges, we propose an innovative abstract modeling approach grounded in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics