Safe Reinforcement Learning with Free-form Natural Language Constraints   and Pre-Trained Language Models

Xingzhou Lou; Junge Zhang; Ziyan Wang; Kaiqi Huang; Yali Du

arXiv:2401.07553·cs.LG·May 16, 2024·1 cites

Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du

PDF

Open Access

TL;DR

This paper introduces a novel safe reinforcement learning approach that leverages pre-trained language models to understand and incorporate free-form natural language constraints, eliminating the need for predefined cost functions and improving safety and flexibility.

Contribution

The paper proposes using pre-trained language models to interpret natural language constraints in safe RL, removing the reliance on ground-truth cost functions and enhancing understanding of diverse human instructions.

Findings

01

Achieves strong performance in grid-world and robot control tasks.

02

Successfully interprets complex natural language constraints.

03

Learns safe policies without ground-truth cost functions.

Abstract

Safe reinforcement learning (RL) agents accomplish given tasks while adhering to specific constraints. Employing constraints expressed via easily-understandable human language offers considerable potential for real-world applications due to its accessibility and non-reliance on domain expertise. Previous safe RL methods with natural language constraints typically adopt a recurrent neural network, which leads to limited capabilities when dealing with various forms of human language input. Furthermore, these methods often require a ground-truth cost function, necessitating domain expertise for the conversion of language constraints into a well-defined cost function that determines constraint violation. To address these issues, we proposes to use pre-trained language models (LM) to facilitate RL agents' comprehension of natural language constraints and allow them to infer costs for safe…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training