DailyArt: Discovering Articulation from Single Static Images via Latent Dynamics

Hang Zhang; Qijian Tian; Jingyu Gong; Daoguo Dong; Xuhong Wang; Yuan Xie; Xin Tan

arXiv:2604.07758·cs.CV·April 10, 2026

DailyArt: Discovering Articulation from Single Static Images via Latent Dynamics

Hang Zhang, Qijian Tian, Jingyu Gong, Daoguo Dong, Xuhong Wang, Yuan Xie, Xin Tan

PDF

1 Repo

TL;DR

DailyArt introduces a novel approach to infer articulated joints from a single static image by synthesizing an open state to reveal articulation cues, enabling joint estimation without multi-view data or explicit priors.

Contribution

It formulates joint estimation as a synthesis-mediated reasoning task, allowing simultaneous recovery of all joints from a single image without object-specific templates or multi-view inputs.

Findings

01

Achieves strong performance in articulated joint estimation.

02

Supports part-level novel state synthesis conditioned on joints.

03

Operates without explicit part annotations at test time.

Abstract

Articulated objects are essential for embodied AI and world models, yet inferring their kinematics from a single closed-state image remains challenging because crucial motion cues are often occluded. Existing methods either require multi-state observations or rely on explicit part priors, retrieval, or other auxiliary inputs that partially expose the structure to be inferred. In this work, we present DailyArt, which formulates articulated joint estimation from a single static image as a synthesis-mediated reasoning problem. Instead of directly regressing joints from a heavily occluded observation, DailyArt first synthesizes a maximally articulated opened state under the same camera view to expose articulation cues, and then estimates the full set of joint parameters from the discrepancy between the observed and synthesized states. Using a set-prediction formulation, DailyArt recovers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://rangooo123.github.io/DaliyArt.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.