RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement   Learning

Yujie Zhao; Jose Efraim Aguilar Escamill; Weyl Lu; Huazheng Wang

arXiv:2410.23569·cs.LG·January 10, 2025

RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning

Yujie Zhao, Jose Efraim Aguilar Escamill, Weyl Lu, Huazheng Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces RA-PbRL, a new risk-aware preference-based reinforcement learning algorithm that optimizes risk-sensitive objectives, with theoretical guarantees and empirical validation, addressing safety-critical applications.

Contribution

We propose RA-PbRL, the first algorithm to optimize nested and static risk-aware objectives in PbRL, with proven sublinear regret bounds and empirical performance evaluation.

Findings

01

RA-PbRL achieves sublinear regret bounds.

02

Empirical results support the effectiveness of risk-aware objectives.

03

Theoretical analysis confirms the algorithm's efficiency.

Abstract

Reinforcement Learning from Human Feedback (RLHF) has recently surged in popularity, particularly for aligning large language models and other AI systems with human intentions. At its core, RLHF can be viewed as a specialized instance of Preference-based Reinforcement Learning (PbRL), where the preferences specifically originate from human judgments rather than arbitrary evaluators. Despite this connection, most existing approaches in both RLHF and PbRL primarily focus on optimizing a mean reward objective, neglecting scenarios that necessitate risk-awareness, such as AI safety, healthcare, and autonomous driving. These scenarios often operate under a one-episode-reward setting, which makes conventional risk-sensitive objectives inapplicable. To address this, we explore and prove the applicability of two risk-aware objectives to PbRL : nested and static quantile risk objectives. We also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aguilarjose11/pbrlneurips
pytorchOfficial

Videos

RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety

MethodsFocus