Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation
Ria Doshi, Homer Walke, Oier Mees, Sudeep Dasari, Sergey Levine

TL;DR
This paper introduces CrossFormer, a transformer-based policy trained on diverse robot data, enabling a single model to control various robot types and outperform specialized policies in real-world tasks.
Contribution
The paper presents CrossFormer, a scalable transformer policy that handles multi-robot data without manual space alignment, trained on the largest diverse dataset for cross-embodiment control.
Findings
CrossFormer controls diverse robots with a single network.
It matches specialized policies' performance in real-world tests.
It outperforms previous cross-embodiment learning methods.
Abstract
Modern machine learning systems rely on large datasets to attain broad generalization, and this often poses a challenge in robot learning, where each robotic platform and task might have only a small dataset. By training a single policy across many different kinds of robots, a robot learning method can leverage much broader and more diverse datasets, which in turn can lead to better generalization and robustness. However, training a single policy on multi-robot data is challenging because robots can have widely varying sensors, actuators, and control frequencies. We propose CrossFormer, a scalable and flexible transformer-based policy that can consume data from any embodiment. We train CrossFormer on the largest and most diverse dataset to date, 900K trajectories across 20 different robot embodiments. We demonstrate that the same network weights can control vastly different robots,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeography and Education Methods · Human Motion and Animation · Educational Tools and Methods
