ArtLLM: Generating Articulated Assets via 3D LLM

Penghao Wang; Siyuan Xie; Hongyu Yan; Xianghui Yang; Jingwei Huang; Chunchao Guo; Jiayuan Gu

arXiv:2603.01142·cs.CV·April 1, 2026

ArtLLM: Generating Articulated Assets via 3D LLM

Penghao Wang, Siyuan Xie, Hongyu Yan, Xianghui Yang, Jingwei Huang, Chunchao Guo, Jiayuan Gu

PDF

TL;DR

ArtLLM is a new framework that generates detailed articulated 3D objects from complete meshes, improving layout accuracy and joint prediction over existing methods, with applications in digital twins and robot learning.

Contribution

Introduces ArtLLM, a 3D multimodal large language model that predicts parts and joints from point clouds, enabling high-quality articulated asset generation from complete meshes.

Findings

01

Outperforms state-of-the-art in part layout accuracy

02

Achieves superior joint prediction results

03

Generalizes well to real-world objects

Abstract

Creating interactive digital environments for gaming, robotics, and simulation relies on articulated 3D objects whose functionality emerges from their part geometry and kinematic structure. However, existing approaches remain fundamentally limited: optimization-based reconstruction methods require slow, per-object joint fitting and typically handle only simple, single-joint objects, while retrieval-based methods assemble parts from a fixed library, leading to repetitive geometry and poor generalization. To address these challenges, we introduce ArtLLM, a novel framework for generating high-quality articulated assets directly from complete 3D meshes. At its core is a 3D multimodal large language model trained on a large-scale articulation dataset curated from both existing articulation datasets and procedurally generated objects. Unlike prior work, ArtLLM autoregressively predicts a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.