Grounding Language Models in Autonomous Loco-manipulation Tasks

Jin Wang; Nikos Tsagarakis

arXiv:2409.01326·cs.RO·September 4, 2024

Grounding Language Models in Autonomous Loco-manipulation Tasks

Jin Wang, Nikos Tsagarakis

PDF

Open Access

TL;DR

This paper introduces a framework combining reinforcement learning, whole-body optimization, and large language models to enable humanoid robots to perform complex, open-ended loco-manipulation tasks with high autonomy in unstructured environments.

Contribution

It presents a novel hierarchical planning approach that integrates language models with motion primitives for versatile humanoid robot control.

Findings

01

Effective adaptation to new loco-manipulation tasks in simulation and real-world

02

High autonomy demonstrated from free-text commands in unstructured scenes

03

Successful integration of LLMs with motion planning for complex tasks

Abstract

Humanoid robots with behavioral autonomy have consistently been regarded as ideal collaborators in our daily lives and promising representations of embodied intelligence. Compared to fixed-based robotic arms, humanoid robots offer a larger operational space while significantly increasing the difficulty of control and planning. Despite the rapid progress towards general-purpose humanoid robots, most studies remain focused on locomotion ability with few investigations into whole-body coordination and tasks planning, thus limiting the potential to demonstrate long-horizon tasks involving both mobility and manipulation under open-ended verbal instructions. In this work, we propose a novel framework that learns, selects, and plans behaviors based on tasks in different scenarios. We combine reinforcement learning (RL) with whole-body optimization to generate robot motions and store them into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Robot Manipulation and Learning