ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes

Yixuan Yang; Luyang Xie; Zhen Luo; Zixiang Zhao; Tongsheng Ding; Mingqi Gao; Feng Zheng

arXiv:2511.12977·cs.CV·November 19, 2025

ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes

Yixuan Yang, Luyang Xie, Zhen Luo, Zixiang Zhao, Tongsheng Ding, Mingqi Gao, Feng Zheng

PDF

Open Access

TL;DR

ArtiWorld is a pipeline that automatically identifies and converts rigid 3D objects into articulated models using LLMs and point cloud data, enabling scalable creation of interactive simulation assets.

Contribution

The paper introduces ArtiWorld, a novel scene-aware pipeline that leverages LLMs and 3D data to automate the conversion of rigid objects into articulated models, reducing manual effort.

Findings

01

Outperforms existing methods across multiple evaluation levels.

02

Achieves state-of-the-art accuracy in generating URDF models.

03

Preserves original object geometry and interactivity.

Abstract

Building interactive simulators and scalable robot-learning environments requires a large number of articulated assets. However, most existing 3D assets in simulation are rigid, and manually converting them into articulated objects is extremely labor- and cost-intensive. This raises a natural question: can we automatically identify articulable objects in a scene and convert them into articulated assets directly? In this paper, we present ArtiWorld, a scene-aware pipeline that localizes candidate articulable objects from textual scene descriptions and reconstructs executable URDF models that preserve the original geometry. At the core of this pipeline is Arti4URDF, which leverages 3D point cloud, prior knowledge of a large language model (LLM), and a URDF-oriented prompt design to rapidly convert rigid objects into interactive URDF-based articulated objects while maintaining their 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis