Safety through feedback in Constrained RL

Shashank Reddy Chirra; Pradeep Varakantham; Praveen Paruchuri

arXiv:2406.19626·cs.AI·January 14, 2025

Safety through feedback in Constrained RL

Shashank Reddy Chirra, Pradeep Varakantham, Praveen Paruchuri

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a scalable feedback-based approach for safe reinforcement learning that leverages trajectory-level feedback, novelty sampling, and surrogate objectives to efficiently learn cost functions in complex environments.

Contribution

It introduces a novel method that extends feedback-based safety learning to complex domains using trajectory-level feedback and novelty sampling, reducing evaluator burden.

Findings

01

Effective in Safety Gymnasium environments

02

Reduces feedback collection costs

03

Scales to complex, real-world scenarios

Abstract

In safety-critical RL settings, the inclusion of an additional cost function is often favoured over the arduous task of modifying the reward function to ensure the agent's safe behaviour. However, designing or evaluating such a cost function can be prohibitively expensive. For instance, in the domain of self-driving, designing a cost function that encompasses all unsafe behaviours (e.g. aggressive lane changes) is inherently complex. In such scenarios, the cost function can be learned from feedback collected offline in between training rounds. This feedback can be system generated or elicited from a human observing the training process. Previous approaches have not been able to scale to complex environments and are constrained to receiving feedback at the state level which can be expensive to collect. To this end, we introduce an approach that scales to more complex domains and extends…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shshnkreddy/RLSF
pytorchOfficial

Videos

Safety through feedback in Constrained RL· slideslive

Taxonomy

TopicsFormal Methods in Verification · Software Reliability and Analysis Research · Software Testing and Debugging Techniques