The Role of Time Delay in Sim2real Transfer of Reinforcement Learning   for Cyber-Physical Systems

Mohamad Chehadeh; Igor Boiko; Yahya Zweiri

arXiv:2209.15216·eess.SY·October 3, 2022·1 cites

The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Cyber-Physical Systems

Mohamad Chehadeh, Igor Boiko, Yahya Zweiri

PDF

Open Access

TL;DR

This paper investigates how fractional time delays affect the transfer of reinforcement learning policies from simulation to real cyber-physical systems, proposing a sampling scheme to improve real-world performance.

Contribution

It introduces a novel analysis of fractional delays in RL for cyber-physical systems and proposes a sampling scheme to enhance sim2real transfer.

Findings

01

Sampling scheme improves RL training efficiency

02

Agents perform well in UAV simulations

03

Delay consideration reduces oscillations

Abstract

This paper analyzes the simulation to reality gap in reinforcement learning (RL) cyber-physical systems with fractional delays (i.e. delays that are non-integer multiple of the sampling period). The consideration of fractional delay has important implications on the nature of the cyber-physical system considered. Systems with delays are non-Markovian, and the system state vector needs to be extended to make the system Markovian. We show that this is not possible when the delay is in the output, and the problem would always be non-Markovian. Based on this analysis, a sampling scheme is proposed that results in efficient RL training and agents that perform well in realistic multirotor unmanned aerial vehicle simulations. We demonstrate that the resultant agents do not produce excessive oscillations, which is not the case with RL agents that do not consider time delay in the model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Security and Resilience · Reinforcement Learning in Robotics