Accelerating Reinforcement Learning of Robotic Manipulations via   Feedback from Large Language Models

Kun Chu; Xufeng Zhao; Cornelius Weber; Mengdi Li; Stefan Wermter

arXiv:2311.02379·cs.RO·November 7, 2023·1 cites

Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models

Kun Chu, Xufeng Zhao, Cornelius Weber, Mengdi Li, Stefan Wermter

PDF

Open Access

TL;DR

This paper presents Lafite-RL, a framework that leverages large language models to provide feedback for reinforcement learning in robotic manipulation, significantly improving learning efficiency and success rates.

Contribution

Introducing Lafite-RL, a novel framework that uses LLMs to guide RL agents in robotic tasks through natural language feedback, enhancing sample efficiency and performance.

Findings

01

Lafite-RL outperforms baseline methods in RLBench tasks.

02

LLM-guided RL improves learning efficiency and success rate.

03

Simple natural language prompts effectively guide robotic learning.

Abstract

Reinforcement Learning (RL) plays an important role in the robotic manipulation domain since it allows self-learning from trial-and-error interactions with the environment. Still, sample efficiency and reward specification seriously limit its potential. One possible solution involves learning from expert guidance. However, obtaining a human expert is impractical due to the high cost of supervising an RL agent, and developing an automatic supervisor is a challenging endeavor. Large Language Models (LLMs) demonstrate remarkable abilities to provide human-like feedback on user inputs in natural language. Nevertheless, they are not designed to directly control low-level robotic motions, as their pretraining is based on vast internet data rather than specific robotics data. In this paper, we introduce the Lafite-RL (Language agent feedback interactive Reinforcement Learning) framework, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Reinforcement Learning in Robotics

MethodsSelf-Learning