Real-DRL: Teach and Learn in Reality
Yanbing Mao, Yihao Cai, Lui Sha

TL;DR
Real-DRL is a novel framework enabling safe, real-time learning for autonomous systems by integrating physics-based safety models with deep reinforcement learning, addressing safety and performance in physical environments.
Contribution
The paper introduces a new Real-DRL framework that combines physics-based safety models with DRL, including a dual self-learning paradigm and safety-informed batch sampling for real-world applications.
Findings
Effective safety assurance in real physical systems
Enhanced learning efficiency through safety-informed batch sampling
Successful experiments on robots demonstrating safety and performance improvements
Abstract
This paper introduces the Real-DRL framework for safety-critical autonomous systems, enabling runtime learning of a deep reinforcement learning (DRL) agent to develop safe and high-performance action policies in real plants (i.e., real physical systems to be controlled), while prioritizing safety! The Real-DRL consists of three interactive components: a DRL-Student, a PHY-Teacher, and a Trigger. The DRL-Student is a DRL agent that innovates in the dual self-learning and teaching-to-learn paradigm and the real-time safety-informed batch sampling. On the other hand, PHY-Teacher is a physics-model-based design of action policies that focuses solely on safety-critical functions. PHY-Teacher is novel in its real-time patch for two key missions: i) fostering the teaching-to-learn paradigm for DRL-Student and ii) backing up the safety of real plants. The Trigger manages the interaction between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robot Manipulation and Learning
