Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

Youguang Xing; Xu Luo; Junlin Xie; Lianli Gao; Hengtao Shen; Jingkuan Song

arXiv:2508.06426·cs.RO·August 11, 2025

Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

Youguang Xing, Xu Luo, Junlin Xie, Lianli Gao, Hengtao Shen, Jingkuan Song

PDF

Open Access

TL;DR

This paper investigates how dataset diversity and fragmentation contribute to shortcut learning in generalist robot policies, revealing that limited diversity and distributional disparities hinder generalization, and proposes data augmentation strategies to mitigate this issue.

Contribution

It identifies dataset fragmentation and limited diversity as key causes of shortcut learning and offers data augmentation methods to improve policy generalization.

Findings

01

Dataset fragmentation leads to reliance on task-irrelevant features.

02

Data augmentation reduces shortcut learning and improves generalization.

03

Strategies are effective in both simulation and real-world environments.

Abstract

Generalist robot policies trained on large-scale datasets such as Open X-Embodiment (OXE) demonstrate strong performance across a wide range of tasks. However, they often struggle to generalize beyond the distribution of their training data. In this paper, we investigate the underlying cause of this limited generalization capability. We identify shortcut learning -- the reliance on task-irrelevant features -- as a key impediment to generalization. Through comprehensive theoretical and empirical analysis, we uncover two primary contributors to shortcut learning: (1) limited diversity within individual sub-datasets, and (2) significant distributional disparities across sub-datasets, leading to dataset fragmentation. These issues arise from the inherent structure of large-scale datasets like OXE, which are typically composed of multiple sub-datasets collected independently across varied…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics