DepGAN: Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition
Amr Ghoneim, Jiju Poovvancheri, Yasushi Akiyama, Dong Chen

TL;DR
DepGAN is a novel GAN that leverages depth maps and alpha channels to improve occlusion handling and transparency effects in image composition, outperforming existing methods on benchmark datasets.
Contribution
We introduce DepGAN, a GAN utilizing depth maps and a new Depth Aware Loss to better handle occlusions and transparency in image composition.
Findings
DepGAN outperforms state-of-the-art methods in accuracy of object placement.
It effectively manages transparency and occlusion handling.
The model shows significant improvements both visually and quantitatively.
Abstract
Image composition is a complex task which requires a lot of information about the scene for an accurate and realistic composition, such as perspective, lighting, shadows, occlusions, and object interactions. Previous methods have predominantly used 2D information for image composition, neglecting the potentials of 3D spatial information. In this work, we propose DepGAN, a Generative Adversarial Network that utilizes depth maps and alpha channels to rectify inaccurate occlusions and enhance transparency effects in image composition. Central to our network is a novel loss function called Depth Aware Loss which quantifies the pixel wise depth difference to accurately delineate occlusion boundaries while compositing objects at different depth levels. Furthermore, we enhance our network's learning process by utilizing opacity data, enabling it to effectively manage compositions involving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction
MethodsAttentive Walk-Aggregating Graph Neural Network
