Is Diversity All You Need for Scalable Robotic Manipulation?

Modi Shi; Li Chen; Jin Chen; Yuxiang Lu; Chiming Liu; Guanghui Ren; Ping Luo; Di Huang; Maoqing Yao; Hongyang Li

arXiv:2507.06219·cs.RO·July 9, 2025

Is Diversity All You Need for Scalable Robotic Manipulation?

Modi Shi, Li Chen, Jin Chen, Yuxiang Lu, Chiming Liu, Guanghui Ren, Ping Luo, Di Huang, Maoqing Yao, Hongyang Li

PDF

Open Access 1 Repo

TL;DR

This paper investigates the impact of data diversity in robotic manipulation, revealing that task diversity is most critical, cross-embodiment transfer can be efficient with high-quality single-embodiment data, and expert diversity can introduce confounding factors.

Contribution

The study challenges the conventional belief that more diverse data is always better, providing new insights into data scaling principles for robotic manipulation.

Findings

01

Task diversity is more important than demonstration quantity.

02

Single-embodiment pre-training can transfer effectively across platforms.

03

Velocity multimodality from expert diversity can confound policy learning.

Abstract

Data scaling has driven remarkable success in foundation models for Natural Language Processing (NLP) and Computer Vision (CV), yet the principles of effective data scaling in robotic manipulation remain insufficiently understood. In this work, we investigate the nuanced role of data diversity in robot learning by examining three critical dimensions-task (what to do), embodiment (which robot to use), and expert (who demonstrates)-challenging the conventional intuition of "more diverse is better". Throughout extensive experiments on various robot platforms, we reveal that (1) task diversity proves more critical than per-task demonstration quantity, benefiting transfer from diverse pre-training tasks to novel downstream scenarios; (2) multi-embodiment pre-training data is optional for cross-embodiment transfer-models trained on high-quality single-embodiment data can efficiently transfer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

opendrivelab/agibot-world
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Reinforcement Learning in Robotics