A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies

Yu Lei; Minghuan Liu; Abhiram Maddukuri; Zhenyu Jiang; Yuke Zhu

arXiv:2604.13645·cs.RO·April 16, 2026

A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies

Yu Lei, Minghuan Liu, Abhiram Maddukuri, Zhenyu Jiang, Yuke Zhu

PDF

TL;DR

This paper provides a theoretical and empirical analysis of sim-and-real co-training for generative robot policies, identifying key effects that influence its success and proposing a method to improve it.

Contribution

It uncovers two intrinsic effects—structured representation alignment and importance reweighting—that govern co-training performance and offers a simple method to enhance existing approaches.

Findings

01

Structured representation alignment is crucial for downstream performance.

02

Importance reweighting modulates action weighting based on domain.

03

The proposed method outperforms prior approaches in experiments.

Abstract

Co-training, which combines limited in-domain real-world data with abundant surrogate data such as simulation or cross-embodiment robot data, is widely used for training generative robot policies. Despite its empirical success, the mechanisms that determine when and why co-training is effective remain poorly understood. We investigate the mechanism of sim-and-real co-training through theoretical analysis and empirical study, and identify two intrinsic effects governing performance. The first, \textbf{``structured representation alignment"}, reflects a balance between cross-domain representation alignment and domain discernibility, and plays a primary role in downstream performance. The second, the \textbf{``importance reweighting effect"}, arises from domain-dependent modulation of action weighting and operates at a secondary level. We validate these effects with controlled experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.