Reinforcement Learning for Control Systems with Time Delays: A Comprehensive Survey
Armando Alves Neto

TL;DR
This survey reviews reinforcement learning methods for control systems with time delays, categorizing approaches, analyzing their trade-offs, and highlighting open challenges for reliable RL deployment in delay-affected cyber-physical systems.
Contribution
It provides a comprehensive classification and analysis of RL techniques addressing time delays, offering practical guidelines and identifying future research directions.
Findings
Categorized five major families of delay-handling RL methods.
Compared advantages and limitations of each approach.
Outlined open challenges like stability certification and multi-agent co-design.
Abstract
In the last decade, Reinforcement Learning (RL) has achieved remarkable success in the control and decision-making of complex dynamical systems. However, most RL algorithms rely on the Markov Decision Process assumption, which is violated in practical cyber-physical systems affected by sensing delays, actuation latencies, and communication constraints. Such time delays introduce memory effects that can significantly degrade performance and compromise stability, particularly in networked and multi-agent environments. This paper presents a comprehensive survey of RL methods designed to address time delays in control systems. We first formalize the main classes of delays and analyze their impact on the Markov property. We then systematically categorize existing approaches into five major families: state augmentation and history-based representations, recurrent policies with learned memory,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Age of Information Optimization · Smart Grid Security and Resilience
