Grounding Language Models in Autonomous Loco-manipulation Tasks
Jin Wang, Nikos Tsagarakis

TL;DR
This paper introduces a framework combining reinforcement learning, whole-body optimization, and large language models to enable humanoid robots to perform complex, open-ended loco-manipulation tasks with high autonomy in unstructured environments.
Contribution
It presents a novel hierarchical planning approach that integrates language models with motion primitives for versatile humanoid robot control.
Findings
Effective adaptation to new loco-manipulation tasks in simulation and real-world
High autonomy demonstrated from free-text commands in unstructured scenes
Successful integration of LLMs with motion planning for complex tasks
Abstract
Humanoid robots with behavioral autonomy have consistently been regarded as ideal collaborators in our daily lives and promising representations of embodied intelligence. Compared to fixed-based robotic arms, humanoid robots offer a larger operational space while significantly increasing the difficulty of control and planning. Despite the rapid progress towards general-purpose humanoid robots, most studies remain focused on locomotion ability with few investigations into whole-body coordination and tasks planning, thus limiting the potential to demonstrate long-horizon tasks involving both mobility and manipulation under open-ended verbal instructions. In this work, we propose a novel framework that learns, selects, and plans behaviors based on tasks in different scenarios. We combine reinforcement learning (RL) with whole-body optimization to generate robot motions and store them into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems · Robot Manipulation and Learning
