ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
Haowen Wang, Xiaoping Yuan, Fugang Zhang, Rui Jian, Yuanwei Zhu, Xiuquan Qiao, Yakun Huang

TL;DR
ArtGen is a novel diffusion-based framework that generates articulated 3D objects from images or text, ensuring accurate geometry and kinematic consistency, advancing capabilities in robotics and digital twins.
Contribution
It introduces a conditional diffusion model with cross-state sampling, reasoning modules, and a compositional 3D-VAE to improve articulated object generation from limited inputs.
Findings
Outperforms state-of-the-art on PartNet-Mobility
Achieves accurate geometry and kinematic consistency
Handles arbitrary part-level states effectively
Abstract
Generating articulated assets is crucial for robotics, digital twins, and embodied intelligence. Existing generative models often rely on single-view inputs representing closed states, resulting in ambiguous or unrealistic kinematic structures due to the entanglement between geometric shape and joint dynamics. To address these challenges, we introduce ArtGen, a conditional diffusion-based framework capable of generating articulated 3D objects with accurate geometry and coherent kinematics from single-view images or text descriptions at arbitrary part-level states. Specifically, ArtGen employs cross-state Monte Carlo sampling to explicitly enforce global kinematic consistency, reducing structural-motion entanglement. Additionally, we integrate a Chain-of-Thought reasoning module to infer robust structural priors, such as part semantics, joint types, and connectivity, guiding a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Robot Manipulation and Learning
