Physical Deep Reinforcement Learning Towards Safety Guarantee

Hongpeng Cao; Yanbing Mao; Lui Sha; Marco Caccamo

arXiv:2303.16860·cs.LG·March 30, 2023·1 cites

Physical Deep Reinforcement Learning Towards Safety Guarantee

Hongpeng Cao, Yanbing Mao, Lui Sha, Marco Caccamo

PDF

Open Access

TL;DR

This paper introduces Phy-DRL, a novel deep reinforcement learning framework that combines physics-based control and data-driven methods to ensure safety and stability in autonomous systems, with proven guarantees and improved performance.

Contribution

The paper proposes a new architecture for DRL, integrating Lyapunov-like rewards and residual control to provide mathematical safety guarantees and enhanced robustness.

Findings

01

Guaranteed safety and stability demonstrated on inverted pendulum

02

Accelerated training and increased rewards achieved

03

Enhanced robustness in control performance

Abstract

Deep reinforcement learning (DRL) has achieved tremendous success in many complex decision-making tasks of autonomous systems with high-dimensional state and/or action spaces. However, the safety and stability still remain major concerns that hinder the applications of DRL to safety-critical autonomous systems. To address the concerns, we proposed the Phy-DRL: a physical deep reinforcement learning framework. The Phy-DRL is novel in two architectural designs: i) Lyapunov-like reward, and ii) residual control (i.e., integration of physics-model-based control and data-driven control). The concurrent physical reward and residual control empower the Phy-DRL the (mathematically) provable safety and stability guarantees. Through experiments on the inverted pendulum, we show that the Phy-DRL features guaranteed safety and stability and enhanced robustness, while offering remarkably accelerated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Electrostatic Discharge in Electronics · Safety Systems Engineering in Autonomy