AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation

Mingyang Li; Haofan Xu; Haowen Sun; Xinzhe Chen; Sihua Ren; Liqi Huang; Xinyang Sui; Chenyang Miao; Jiawei Ye; Qiongjie Cui; Zeyang Liu; Xingyu Chen; Xuguang Lan

arXiv:2604.11674·cs.RO·May 12, 2026

AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation

Mingyang Li, Haofan Xu, Haowen Sun, Xinzhe Chen, Sihua Ren, Liqi Huang, Xinyang Sui, Chenyang Miao, Jiawei Ye, Qiongjie Cui, Zeyang Liu, Xingyu Chen, Xuguang Lan

PDF

TL;DR

AffordSim is a scalable simulation framework that generates affordance-aware data for robotic manipulation, improving task success and enabling zero-shot transfer to real robots.

Contribution

It introduces a novel data generator integrating open-vocabulary 3D affordance prediction for scalable, task-relevant robotic manipulation simulation.

Findings

01

Achieves 93% success rate compared to manual annotations on critical tasks.

02

Attains 89% success on complex composite tasks.

03

Zero-shot policies trained on AffordSim data reach 24% success on real robots.

Abstract

Many everyday robot manipulation skills are affordance-dependent, with success determined by whether the robot contacts the functional object region required by the subsequent action. Current simulation data generators obtain contacts from generic grasp estimators or per-object manual contact annotations, but generic estimators rank stable grasps without task semantics and often select contacts that are misaligned with the downstream action, while manual contact annotations must be rewritten for each new object and task. To solve these challenges, we introduce AffordSim, a scalable data generator and benchmark that integrates open-vocabulary 3D affordance prediction into simulation-based trajectory generation. Given a natural-language task description, AffordSim synthesizes a task-relevant scene, emits affordance queries, grounds them on object surfaces, samples region-conditioned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.