Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models

Ruixing Jin; Zicheng Zhu; Ruixiang Ouyang; Sheng Xu; Bo Yue; Zhizheng Wu; Guiliang Liu

arXiv:2603.22876·cs.RO·March 25, 2026

Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models

Ruixing Jin, Zicheng Zhu, Ruixiang Ouyang, Sheng Xu, Bo Yue, Zhizheng Wu, Guiliang Liu

PDF

Open Access

TL;DR

This paper empirically investigates the key factors affecting the transfer of dexterous manipulation policies from simulation to real-world, focusing on Vision-Language-Action models, and provides a standardized evaluation protocol and benchmark.

Contribution

It offers a comprehensive empirical analysis of Sim-to-Real transfer factors for VLA models and introduces a standardized evaluation protocol and benchmark for dexterous manipulation.

Findings

01

Multi-level domain randomization impacts transfer performance.

02

Photorealistic rendering improves real-world task success.

03

Physics-realistic modeling enhances policy robustness.

Abstract

Learning a generalist control policy for dexterous manipulation typically relies on large-scale datasets. Given the high cost of real-world data collection, a practical alternative is to generate synthetic data through simulation. However, the resulting synthetic data often exhibits a significant gap from real-world distributions. While many prior studies have proposed algorithms to bridge the Sim-to-Real discrepancy, there remains a lack of principled research that grounds these methods in real-world manipulation tasks, particularly their performance on generalist policies such as Vision-Language-Action (VLA) models. In this study, we empirically examine the primary determinants of Sim-to-Real generalization across four dimensions: multi-level domain randomization, photorealistic rendering, physics-realistic modeling, and reinforcement learning updates. To support this study, we design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning