DexSim2Real$^{2}$: Building Explicit World Model for Precise Articulated Object Dexterous Manipulation
Taoran Jiang, Yixuan Guan, Liqian Ma, Jing Xu, Jiaojiao Meng, Weihang Chen, Zecui Zeng, Lusong Li, Dan Wu, Rui Chen

TL;DR
DexSim2Real2 introduces an explicit world model for goal-conditioned articulated object manipulation, enabling precise control without demonstrations or reinforcement learning by combining active interactions, 3D AIGC modeling, and eigengrasp for efficient planning.
Contribution
The paper presents a novel framework that constructs an explicit world model for unseen articulated objects using active interactions and 3D AIGC, facilitating precise manipulation without RL or demonstrations.
Findings
Effective manipulation with suction and gripper tools.
Generalizes to tool-based manipulation strategies.
Validates precise control on various robotic setups.
Abstract
Articulated objects are ubiquitous in daily life. In this paper, we present DexSim2Real, a novel framework for goal-conditioned articulated object manipulation. The core of our framework is constructing an explicit world model of unseen articulated objects through active interactions, which enables sampling-based model predictive control to plan trajectories achieving different goals without requiring demonstrations or RL. It first predicts an interaction using an affordance network trained on self-supervised interaction data or videos of human manipulation. After executing the interactions on the real robot to move the object parts, we propose a novel modeling pipeline based on 3D AIGC to build a digital twin of the object in simulation from multiple frames of observations. For dexterous hands, we utilize eigengrasp to reduce the action dimension, enabling more efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Anomaly Detection Techniques and Applications · Image Processing and 3D Reconstruction
