The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Cyber-Physical Systems
Mohamad Chehadeh, Igor Boiko, Yahya Zweiri

TL;DR
This paper investigates how fractional time delays affect the transfer of reinforcement learning policies from simulation to real cyber-physical systems, proposing a sampling scheme to improve real-world performance.
Contribution
It introduces a novel analysis of fractional delays in RL for cyber-physical systems and proposes a sampling scheme to enhance sim2real transfer.
Findings
Sampling scheme improves RL training efficiency
Agents perform well in UAV simulations
Delay consideration reduces oscillations
Abstract
This paper analyzes the simulation to reality gap in reinforcement learning (RL) cyber-physical systems with fractional delays (i.e. delays that are non-integer multiple of the sampling period). The consideration of fractional delay has important implications on the nature of the cyber-physical system considered. Systems with delays are non-Markovian, and the system state vector needs to be extended to make the system Markovian. We show that this is not possible when the delay is in the output, and the problem would always be non-Markovian. Based on this analysis, a sampling scheme is proposed that results in efficient RL training and agents that perform well in realistic multirotor unmanned aerial vehicle simulations. We demonstrate that the resultant agents do not produce excessive oscillations, which is not the case with RL agents that do not consider time delay in the model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Security and Resilience · Reinforcement Learning in Robotics
