Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang, Maneesh Agrawala

TL;DR
This paper introduces LayerDiffuse, a method to generate transparent images using pretrained latent diffusion models by learning a latent transparency, enabling high-quality transparent layer generation with minimal model modifications.
Contribution
It proposes a novel approach to encode transparency into the latent space of diffusion models, allowing conversion of existing models into transparent image generators through finetuning.
Findings
Achieved 97% user preference for generated transparent images over previous methods.
Demonstrated compatibility with various open source diffusion models.
Generated transparent images comparable in quality to commercial assets.
Abstract
We present LayerDiffuse, an approach enabling large-scale pretrained latent diffusion models to generate transparent images. The method allows generation of single transparent images or of multiple transparent layers. The method learns a "latent transparency" that encodes alpha channel transparency into the latent manifold of a pretrained latent diffusion model. It preserves the production-ready quality of the large diffusion model by regulating the added transparency as a latent offset with minimal changes to the original latent distribution of the pretrained model. In this way, any latent diffusion model can be converted into a transparent image generator by finetuning it with the adjusted latent space. We train the model with 1M transparent image layer pairs collected using a human-in-the-loop collection scheme. We show that latent transparency can be applied to different open source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Advanced Data Compression Techniques
MethodsDiffusion · Latent Diffusion Model
