ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes
Yixuan Yang, Luyang Xie, Zhen Luo, Zixiang Zhao, Tongsheng Ding, Mingqi Gao, Feng Zheng

TL;DR
ArtiWorld is a pipeline that automatically identifies and converts rigid 3D objects into articulated models using LLMs and point cloud data, enabling scalable creation of interactive simulation assets.
Contribution
The paper introduces ArtiWorld, a novel scene-aware pipeline that leverages LLMs and 3D data to automate the conversion of rigid objects into articulated models, reducing manual effort.
Findings
Outperforms existing methods across multiple evaluation levels.
Achieves state-of-the-art accuracy in generating URDF models.
Preserves original object geometry and interactivity.
Abstract
Building interactive simulators and scalable robot-learning environments requires a large number of articulated assets. However, most existing 3D assets in simulation are rigid, and manually converting them into articulated objects is extremely labor- and cost-intensive. This raises a natural question: can we automatically identify articulable objects in a scene and convert them into articulated assets directly? In this paper, we present ArtiWorld, a scene-aware pipeline that localizes candidate articulable objects from textual scene descriptions and reconstructs executable URDF models that preserve the original geometry. At the core of this pipeline is Arti4URDF, which leverages 3D point cloud, prior knowledge of a large language model (LLM), and a URDF-oriented prompt design to rapidly convert rigid objects into interactive URDF-based articulated objects while maintaining their 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis
