MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas, Srinath Sridhar

TL;DR
MotionGlot is a versatile motion generation model that adapts language model training principles to produce diverse motions across different embodiments, validated through multiple tasks, datasets, and real-world experiments.
Contribution
We introduce MotionGlot, a novel multi-embodied motion generation model using instruction-tuning inspired by large language models, along with two new motion datasets.
Findings
35.3% average improvement across tasks
Successful adaptation of LLM training principles to motion generation
Validated in real-world hardware experiments
Abstract
This paper introduces MotionGlot, a model that can generate motion across multiple embodiments with different action dimensions, such as quadruped robots and human bodies. By leveraging the well-established training procedures commonly used in large language models (LLMs), we introduce an instruction-tuning template specifically designed for motionrelated tasks. Our approach demonstrates that the principles underlying LLM training can be successfully adapted to learn a wide range of motion generation tasks across multiple embodiments with different action dimensions. We demonstrate the various abilities of MotionGlot on a set of 6 tasks and report an average improvement of 35.3% across tasks. Additionally, we contribute two new datasets: (1) a dataset of expert-controlled quadruped locomotion with approximately 48,000 trajectories paired with direction-based text annotations, and (2) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
