Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du

TL;DR
This paper introduces a novel safe reinforcement learning approach that leverages pre-trained language models to understand and incorporate free-form natural language constraints, eliminating the need for predefined cost functions and improving safety and flexibility.
Contribution
The paper proposes using pre-trained language models to interpret natural language constraints in safe RL, removing the reliance on ground-truth cost functions and enhancing understanding of diverse human instructions.
Findings
Achieves strong performance in grid-world and robot control tasks.
Successfully interprets complex natural language constraints.
Learns safe policies without ground-truth cost functions.
Abstract
Safe reinforcement learning (RL) agents accomplish given tasks while adhering to specific constraints. Employing constraints expressed via easily-understandable human language offers considerable potential for real-world applications due to its accessibility and non-reliance on domain expertise. Previous safe RL methods with natural language constraints typically adopt a recurrent neural network, which leads to limited capabilities when dealing with various forms of human language input. Furthermore, these methods often require a ground-truth cost function, necessitating domain expertise for the conversion of language constraints into a well-defined cost function that determines constraint violation. To address these issues, we proposes to use pre-trained language models (LM) to facilitate RL agents' comprehension of natural language constraints and allow them to infer costs for safe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsSparse Evolutionary Training
