Feasibility Consistent Representation Learning for Safe Reinforcement   Learning

Zhepeng Cen; Yihang Yao; Zuxin Liu; Ding Zhao

arXiv:2405.11718·cs.LG·June 14, 2024

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Zhepeng Cen, Yihang Yao, Zuxin Liu, Ding Zhao

PDF

Open Access 1 Repo

TL;DR

This paper introduces FCSRL, a novel framework that combines representation learning with safety constraints to improve safe reinforcement learning, especially in estimating safety metrics from raw states.

Contribution

The paper proposes a new framework that integrates self-supervised representation learning with safety constraints to enhance safe RL performance and safety estimation accuracy.

Findings

01

Outperforms previous baselines in safety-aware embedding learning

02

Achieves better safety constraint estimation from raw states

03

Demonstrates effectiveness on vector and image-based tasks

Abstract

In the field of safe reinforcement learning (RL), finding a balance between satisfying safety constraints and optimizing reward performance presents a significant challenge. A key obstacle in this endeavor is the estimation of safety constraints, which is typically more difficult than estimating a reward metric due to the sparse nature of the constraint signals. To address this issue, we introduce a novel framework named Feasibility Consistent Safe Reinforcement Learning (FCSRL). This framework combines representation learning with feasibility-oriented objectives to identify and extract safety-related information from the raw state for safe RL. Leveraging self-supervised learning techniques and a more learnable safety metric, our approach enhances the policy learning and constraint estimation. Empirical evaluations across a range of vector-state and image-based tasks demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

czp16/fcsrl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning