VSDiffusion: Taming Ill-Posed Shadow Generation via Visibility-Constrained Diffusion
Jing Li, Jing Zhang

TL;DR
VSDiffusion is a novel two-stage diffusion framework that uses visibility priors to generate realistic, geometrically consistent shadows in complex scenes, significantly improving shadow accuracy and boundary sharpness.
Contribution
The paper introduces VSDiffusion, a new visibility-constrained diffusion method with a two-stage process and innovative priors for improved shadow generation in image composition.
Findings
Achieves state-of-the-art shadow accuracy on DESOBAv2 dataset.
Effectively sharpens shadow boundaries and enhances texture interaction.
Outperforms existing methods across multiple evaluation metrics.
Abstract
Generating realistic cast shadows for inserted foreground objects is a crucial yet challenging problem in image composition, where maintaining geometric consistency of shadow and object in complex scenes remains difficult due to the ill-posed nature of shadow formation. To address this issue, we propose VSDiffusion, a visibility-constrained two-stage framework designed to narrow the solution space by incorporating visibility priors. In Stage I, we predict a coarse shadow mask to localize plausible shadow generated regions. And in Stage II, conditional diffusion is performed guided by lighting and depth cues estimated from the composite to generate accurate shadows. In VSDiffusion, we inject visibility priors through two complementary pathways. First, a visibility control branch with shadow-gated cross attention that provides multi-scale structural guidance. Then, a learned soft prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Image Enhancement Techniques
