Arnold: a generalist muscle transformer policy
Alberto Silvio Chiappa, Boshi An, Merkourios Simos, Chengkun Li, Alexander Mathis

TL;DR
Arnold is a versatile transformer-based policy that learns multiple musculoskeletal control tasks and embodiments, enabling rapid adaptation and providing insights into biological motor control.
Contribution
It introduces Arnold, a generalist muscle transformer policy that combines behavior cloning and PPO, with a novel sensorimotor vocabulary for multi-task, multi-embodiment learning.
Findings
Achieves expert performance in 14 control tasks
Supports rapid adaptation to new tasks
Provides insights into muscle synergy transferability
Abstract
Controlling high-dimensional and nonlinear musculoskeletal models of the human body is a foundational scientific challenge. Recent machine learning breakthroughs have heralded policies that master individual skills like reaching, object manipulation and locomotion in musculoskeletal systems with many degrees of freedom. However, these agents are merely "specialists", achieving high performance for a single skill. In this work, we develop Arnold, a generalist policy that masters multiple tasks and embodiments. Arnold combines behavior cloning and fine-tuning with PPO to achieve expert or super-expert performance in 14 challenging control tasks from dexterous object manipulation to locomotion. A key innovation is Arnold's sensorimotor vocabulary, a compositional representation of the semantics of heterogeneous sensory modalities, objectives, and actuators. Arnold leverages this vocabulary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
