Real-DRL: Teach and Learn in Reality

Yanbing Mao; Yihao Cai; Lui Sha

arXiv:2511.00112·cs.RO·November 4, 2025

Real-DRL: Teach and Learn in Reality

Yanbing Mao, Yihao Cai, Lui Sha

PDF

Open Access 1 Video

TL;DR

Real-DRL is a novel framework enabling safe, real-time learning for autonomous systems by integrating physics-based safety models with deep reinforcement learning, addressing safety and performance in physical environments.

Contribution

The paper introduces a new Real-DRL framework that combines physics-based safety models with DRL, including a dual self-learning paradigm and safety-informed batch sampling for real-world applications.

Findings

01

Effective safety assurance in real physical systems

02

Enhanced learning efficiency through safety-informed batch sampling

03

Successful experiments on robots demonstrating safety and performance improvements

Abstract

This paper introduces the Real-DRL framework for safety-critical autonomous systems, enabling runtime learning of a deep reinforcement learning (DRL) agent to develop safe and high-performance action policies in real plants (i.e., real physical systems to be controlled), while prioritizing safety! The Real-DRL consists of three interactive components: a DRL-Student, a PHY-Teacher, and a Trigger. The DRL-Student is a DRL agent that innovates in the dual self-learning and teaching-to-learn paradigm and the real-time safety-informed batch sampling. On the other hand, PHY-Teacher is a physics-model-based design of action policies that focuses solely on safety-critical functions. PHY-Teacher is novel in its real-time patch for two key missions: i) fostering the teaching-to-learn paradigm for DRL-Student and ii) backing up the safety of real plants. The Trigger manages the interaction between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Real-DRL: Teach and Learn in Reality· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robot Manipulation and Learning