Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models
Kun Chu, Xufeng Zhao, Cornelius Weber, Mengdi Li, Stefan Wermter

TL;DR
This paper presents Lafite-RL, a framework that leverages large language models to provide feedback for reinforcement learning in robotic manipulation, significantly improving learning efficiency and success rates.
Contribution
Introducing Lafite-RL, a novel framework that uses LLMs to guide RL agents in robotic tasks through natural language feedback, enhancing sample efficiency and performance.
Findings
Lafite-RL outperforms baseline methods in RLBench tasks.
LLM-guided RL improves learning efficiency and success rate.
Simple natural language prompts effectively guide robotic learning.
Abstract
Reinforcement Learning (RL) plays an important role in the robotic manipulation domain since it allows self-learning from trial-and-error interactions with the environment. Still, sample efficiency and reward specification seriously limit its potential. One possible solution involves learning from expert guidance. However, obtaining a human expert is impractical due to the high cost of supervising an RL agent, and developing an automatic supervisor is a challenging endeavor. Large Language Models (LLMs) demonstrate remarkable abilities to provide human-like feedback on user inputs in natural language. Nevertheless, they are not designed to directly control low-level robotic motions, as their pretraining is based on vast internet data rather than specific robotics data. In this paper, we introduce the Lafite-RL (Language agent feedback interactive Reinforcement Learning) framework, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Reinforcement Learning in Robotics
MethodsSelf-Learning
