MotionGlot: A Multi-Embodied Motion Generation Model

Sudarshan Harithas; Srinath Sridhar

arXiv:2410.16623·cs.RO·May 2, 2025

MotionGlot: A Multi-Embodied Motion Generation Model

Sudarshan Harithas, Srinath Sridhar

PDF

Open Access

TL;DR

MotionGlot is a versatile motion generation model that adapts language model training principles to produce diverse motions across different embodiments, validated through multiple tasks, datasets, and real-world experiments.

Contribution

We introduce MotionGlot, a novel multi-embodied motion generation model using instruction-tuning inspired by large language models, along with two new motion datasets.

Findings

01

35.3% average improvement across tasks

02

Successful adaptation of LLM training principles to motion generation

03

Validated in real-world hardware experiments

Abstract

This paper introduces MotionGlot, a model that can generate motion across multiple embodiments with different action dimensions, such as quadruped robots and human bodies. By leveraging the well-established training procedures commonly used in large language models (LLMs), we introduce an instruction-tuning template specifically designed for motionrelated tasks. Our approach demonstrates that the principles underlying LLM training can be successfully adapted to learn a wide range of motion generation tasks across multiple embodiments with different action dimensions. We demonstrate the various abilities of MotionGlot on a set of 6 tasks and report an average improvement of 35.3% across tasks. Additionally, we contribute two new datasets: (1) a dataset of expert-controlled quadruped locomotion with approximately 48,000 trajectories paired with direction-based text annotations, and (2) a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis