GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Ning Gao; Yilun Chen; Shuai Yang; Xinyi Chen; Yang Tian; Hao Li; Haifeng Huang; Hanqing Wang; Tai Wang; Jiangmiao Pang

arXiv:2506.10966·cs.RO·June 13, 2025

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Ning Gao, Yilun Chen, Shuai Yang, Xinyi Chen, Yang Tian, Hao Li, Haifeng Huang, Hanqing Wang, Tai Wang, Jiangmiao Pang

PDF

Open Access

TL;DR

This paper introduces GenManip, a realistic simulation platform and benchmark for evaluating the generalization of instruction-following policies in robotic manipulation, emphasizing the role of foundation models and data scaling.

Contribution

It presents a new simulation platform and benchmark for studying policy generalization, along with an evaluation of modular versus end-to-end policies in diverse scenarios.

Findings

01

Modular systems with foundation models outperform end-to-end policies in generalization.

02

Data scaling benefits end-to-end policies more significantly.

03

GenManip enables systematic evaluation of policy adaptability in realistic settings.

Abstract

Robotic manipulation in real-world settings remains challenging, especially regarding robust generalization. Existing simulation platforms lack sufficient support for exploring how policies adapt to varied instructions and scenarios. Thus, they lag behind the growing interest in instruction-following foundation models like LLMs, whose adaptability is crucial yet remains underexplored in fair comparisons. To bridge this gap, we introduce GenManip, a realistic tabletop simulation platform tailored for policy generalization studies. It features an automatic pipeline via LLM-driven task-oriented scene graph to synthesize large-scale, diverse tasks using 10K annotated 3D object assets. To systematically assess generalization, we present GenManip-Bench, a benchmark of 200 scenarios refined via human-in-the-loop corrections. We evaluate two policy types: (1) modular manipulation systems…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications