Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna; Chethan Bhateja; Yichen Jiang; Karl Pertsch; Dorsa Sadigh

arXiv:2408.14037·cs.RO·February 24, 2025

Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

PDF

Open Access 1 Repo

TL;DR

This paper introduces Re-Mix, a method that optimizes data weighting in large-scale robotics datasets using distributionally robust optimization, significantly improving downstream robot policy performance.

Contribution

Re-Mix is the first approach applying DRO to robotics datasets, addressing variability challenges and enhancing data curation for robot foundation models.

Findings

01

Re-Mix outperforms uniform weighting by 38% in downstream tasks.

02

Re-Mix surpasses human-selected weights by 32%.

03

Data curation critically impacts robot policy performance.

Abstract

Increasingly large imitation learning datasets are being collected with the goal of training foundation models for robotics. However, despite the fact that data selection has been of utmost importance in vision and natural language processing, little work in robotics has questioned what data such models should actually be trained on. In this work we investigate how to weigh different subsets or ``domains'' of robotics datasets for robot foundation model pre-training. Concrete, we use distributionally robust optimization (DRO) to maximize worst-case performance across all possible downstream domains. Our method, Re-Mix, addresses the wide range of challenges that arise when applying DRO to robotics datasets including variability in action spaces and dynamics across different datasets. Re-Mix employs early stopping, action normalization, and discretization to counteract these issues.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jhejna/remix
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification