LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents

Rui Li; Zixuan Hu; Wenxi Qu; Jinouwen Zhang; Zhenfei Yin; Sha Zhang; Xuantuo Huang; Hanqing Wang; Tai Wang; Jiangmiao Pang; Wanli Ouyang; Lei Bai; Wangmeng Zuo; Ling-Yu Duan; Dongzhan Zhou; Shixiang Tang

arXiv:2505.22634·cs.RO·December 9, 2025

LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents

Rui Li, Zixuan Hu, Wenxi Qu, Jinouwen Zhang, Zhenfei Yin, Sha Zhang, Xuantuo Huang, Hanqing Wang, Tai Wang, Jiangmiao Pang, Wanli Ouyang, Lei Bai, Wangmeng Zuo, Ling-Yu Duan, Dongzhan Zhou, Shixiang Tang

PDF

Open Access

TL;DR

LabUtopia introduces a high-fidelity simulation platform and hierarchical benchmark tailored for developing and evaluating embodied agents in complex laboratory environments, addressing a key gap in scientific automation research.

Contribution

It presents LabUtopia, combining a detailed simulator, procedural scene generator, and multi-level benchmark to advance embodied intelligence in scientific settings.

Findings

01

Supports 30 tasks with diverse assets

02

Enables large-scale training and evaluation

03

Facilitates research on perception, planning, and control

Abstract

Scientific embodied agents play a crucial role in modern laboratories by automating complex experimental workflows. Compared to typical household environments, laboratory settings impose significantly higher demands on perception of physical-chemical transformations and long-horizon planning, making them an ideal testbed for advancing embodied intelligence. However, its development has been long hampered by the lack of suitable simulator and benchmarks. In this paper, we address this gap by introducing LabUtopia, a comprehensive simulation and benchmarking suite designed to facilitate the development of generalizable, reasoning-capable embodied agents in laboratory settings. Specifically, it integrates i) LabSim, a high-fidelity simulator supporting multi-physics and chemically meaningful interactions; ii) LabScene, a scalable procedural generator for diverse scientific scenes; and iii)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Action Observation and Synchronization · Multimodal Machine Learning Applications