State-Wise Safe Reinforcement Learning With Pixel Observations

Simon Sinong Zhan; Yixuan Wang; Qingyuan Wu; Ruochen Jiao; Chao Huang,; Qi Zhu

arXiv:2311.02227·cs.LG·December 13, 2023·1 cites

State-Wise Safe Reinforcement Learning With Pixel Observations

Simon Sinong Zhan, Yixuan Wang, Qingyuan Wu, Ruochen Jiao, Chao Huang,, Qi Zhu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel safe reinforcement learning algorithm that uses pixel observations and latent barrier functions to effectively balance safety and reward maximization, especially in complex environments.

Contribution

It proposes a new pixel-based safe RL method with a latent barrier-like function, enabling efficient safety constraint encoding and improved safety during training.

Findings

01

Significantly reduces safety violations during training

02

Achieves faster safety convergence than existing methods

03

Maintains competitive reward performance

Abstract

In the context of safe exploration, Reinforcement Learning (RL) has long grappled with the challenges of balancing the tradeoff between maximizing rewards and minimizing safety violations, particularly in complex environments with contact-rich or non-smooth dynamics, and when dealing with high-dimensional pixel observations. Furthermore, incorporating state-wise safety constraints in the exploration and learning process, where the agent must avoid unsafe regions without prior knowledge, adds another layer of complexity. In this paper, we propose a novel pixel-observation safe RL algorithm that efficiently encodes state-wise safety constraints with unknown hazard regions through a newly introduced latent barrier-like function learning mechanism. As a joint learning framework, our approach begins by constructing a latent dynamics model with low-dimensional latent spaces derived from pixel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

simonzhan-code/step-wise_saferl_pixel
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety