URDF-Anything+: Autoregressive Articulated 3D Models Generation for Physical Simulation

Zhuangzhe Wu; Yue Xin; Chengkai Hou; Minghao Chen; Yaoxu Lyu; Jieyu Zhang; Shanghang Zhang

arXiv:2603.14010·cs.RO·March 17, 2026

URDF-Anything+: Autoregressive Articulated 3D Models Generation for Physical Simulation

Zhuangzhe Wu, Yue Xin, Chengkai Hou, Minghao Chen, Yaoxu Lyu, Jieyu Zhang, Shanghang Zhang

PDF

Open Access

TL;DR

This paper introduces URDF-Anything+, an autoregressive framework that generates complete, executable 3D articulated object models from visual data, facilitating realistic simulation and robot transfer without multi-stage processing.

Contribution

It presents a novel end-to-end autoregressive method for directly generating URDF models from images, improving reconstruction quality and enabling real-to-simulation transfer.

Findings

01

Outperforms prior methods in geometric reconstruction quality.

02

Achieves higher joint parameter accuracy.

03

Enables effective real robot policy transfer.

Abstract

Articulated objects are fundamental for robotics, simulation of physics, and interactive virtual environments. However, reconstructing them from visual input remains challenging, as it requires jointly inferring both part geometry and kinematic structure. We present, an end-to-end autoregressive framework that directly generates executable articulated object models from visual observations. Given image and object-level 3D cues, our method sequentially produces part geometries and their associated joint parameters, resulting in complete URDF models without reliance on multi-stage pipelines. The generation proceeds until the model determines that all parts have been produced, automatically inferring complete geometry and kinematics. Building on this capability, we enable a new Real-Follow-Sim paradigm, where high-fidelity digital twins constructed from visual observations allow policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Robot Manipulation and Learning · Human Motion and Animation