DreamArt: Generating Interactable Articulated Objects from a Single Image

Ruijie Lu; Yu Liu; Jiaxiang Tang; Junfeng Ni; Yuxiang Wang; Diwen Wan; Gang Zeng; Yixin Chen; Siyuan Huang

arXiv:2507.05763·cs.CV·July 9, 2025

DreamArt: Generating Interactable Articulated Objects from a Single Image

Ruijie Lu, Yu Liu, Jiaxiang Tang, Junfeng Ni, Yuxiang Wang, Diwen Wan, Gang Zeng, Yixin Chen, Siyuan Huang

PDF

Open Access

TL;DR

DreamArt is a new framework that creates high-quality, interactable articulated 3D objects from a single image, combining segmentation, articulation modeling, and texture refinement for scalable asset generation.

Contribution

It introduces a three-stage pipeline for generating articulated 3D objects from single images, including part segmentation, articulation prior learning, and texture optimization, which is novel in this domain.

Findings

01

Produces high-fidelity, articulated 3D objects from single images

02

Accurately models part shapes and plausible articulation

03

Ensures coherent textures across object parts

Abstract

Generating articulated objects, such as laptops and microwaves, is a crucial yet challenging task with extensive applications in Embodied AI and AR/VR. Current image-to-3D methods primarily focus on surface geometry and texture, neglecting part decomposition and articulation modeling. Meanwhile, neural reconstruction approaches (e.g., NeRF or Gaussian Splatting) rely on dense multi-view or interaction data, limiting their scalability. In this paper, we introduce DreamArt, a novel framework for generating high-fidelity, interactable articulated assets from single-view images. DreamArt employs a three-stage pipeline: firstly, it reconstructs part-segmented and complete 3D object meshes through a combination of image-to-3D generation, mask-prompted 3D segmentation, and part amodal completion. Second, we fine-tune a video diffusion model to capture part-level articulation priors, leveraging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Robot Manipulation and Learning

MethodsDiffusion · Focus