Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs

Anran Qi; Changjian Li; Adrien Bousseau; Niloy J.Mitra

arXiv:2512.13392·cs.CV·December 17, 2025

Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs

Anran Qi, Changjian Li, Adrien Bousseau, Niloy J.Mitra

PDF

Open Access

TL;DR

This paper introduces a novel, training-free image-to-video generation method that allows explicit user control over disoccluded regions by separating motion specification from appearance synthesis using a Proxy Dynamic Graph and diffusion prior.

Contribution

It proposes a lightweight, user-editable Proxy Dynamic Graph for deterministic motion control combined with a diffusion prior for appearance synthesis, enabling controllable video generation without fine-tuning.

Findings

01

Outperforms state-of-the-art in controllable articulated object videos

02

Allows user editing of disoccluded regions in generated videos

03

Enables predictable motion and appearance control in image-to-video tasks

Abstract

We address image-to-video generation with explicit user control over the final frame's disoccluded regions. Current image-to-video pipelines produce plausible motion but struggle to generate predictable, articulated motions while enforcing user-specified content in newly revealed areas. Our key idea is to separate motion specification from appearance synthesis: we introduce a lightweight, user-editable Proxy Dynamic Graph (PDG) that deterministically yet approximately drives part motion, while a frozen diffusion prior is used to synthesize plausible appearance that follows that motion. In our training-free pipeline, the user loosely annotates and reposes a PDG, from which we compute a dense motion flow to leverage diffusion as a motion-guided shader. We then let the user edit appearance in the disoccluded areas of the image, and exploit the visibility information encoded by the PDG to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Face recognition and analysis