Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon   Robotics Tasks

Murtaza Dalal; Tarun Chiruvolu; Devendra Chaplot; Ruslan Salakhutdinov

arXiv:2405.01534·cs.LG·May 3, 2024·5 cites

Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks

Murtaza Dalal, Tarun Chiruvolu, Devendra Chaplot, Ruslan Salakhutdinov

PDF

Open Access

TL;DR

Plan-Seq-Learn (PSL) leverages language models and motion planning to guide reinforcement learning, enabling robots to solve complex long-horizon tasks from raw visuals without pre-defined skills, achieving state-of-the-art results.

Contribution

The paper introduces PSL, a modular framework combining language-guided planning with RL to address long-horizon robotics tasks without pre-defined skill libraries.

Findings

01

Achieves over 85% success rate on 25+ challenging tasks.

02

Outperforms existing language-based and classical methods.

03

Effective on raw visual inputs across four benchmarks.

Abstract

Large Language Models (LLMs) have been shown to be capable of performing high-level planning for long-horizon robotics tasks, yet existing methods require access to a pre-defined skill library (e.g. picking, placing, pulling, pushing, navigating). However, LLM planning does not address how to design or learn those behaviors, which remains challenging particularly in long-horizon settings. Furthermore, for many tasks of interest, the robot needs to be able to adjust its behavior in a fine-grained manner, requiring the agent to be capable of modifying low-level control actions. Can we instead use the internet-scale knowledge from LLMs for high-level policies, guiding reinforcement learning (RL) policies to efficiently solve robotic control tasks online without requiring a pre-determined set of skills? In this paper, we propose Plan-Seq-Learn (PSL): a modular approach that uses motion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems

MethodsSparse Evolutionary Training · Lib