Self-supervised Matting-specific Portrait Enhancement and Generation
Yangyang Xu Zeyang Zhou, Shengfeng He

TL;DR
This paper introduces a novel approach that enhances portrait images in the latent space of StyleGAN to improve alpha matting accuracy and generate pseudo ground truth data, addressing annotation costs.
Contribution
It proposes a method to refine portraits for better matting by optimizing latent vectors, and leverages StyleGAN's generative capabilities to produce training data, improving matting performance.
Findings
Boosts automatic alpha matting performance significantly.
Refines real portrait images for compatibility with existing matting models.
Generates pseudo ground truth data to reduce annotation costs.
Abstract
We resolve the ill-posed alpha matting problem from a completely different perspective. Given an input portrait image, instead of estimating the corresponding alpha matte, we focus on the other end, to subtly enhance this input so that the alpha matte can be easily estimated by any existing matting models. This is accomplished by exploring the latent space of GAN models. It is demonstrated that interpretable directions can be found in the latent space and they correspond to semantic image transformations. We further explore this property in alpha matting. Particularly, we invert an input portrait into the latent code of StyleGAN, and our aim is to discover whether there is an enhanced version in the latent space which is more compatible with a reference matting model. We optimize multi-scale latent vectors in the latent spaces under four tailored losses, ensuring matting-specificity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Image and Signal Denoising Methods · Advanced Vision and Imaging
MethodsHuMan(Expedia)||How do I get a human at Expedia? · StyleGAN · Dense Connections · Convolution · Feedforward Network · Adaptive Instance Normalization · R1 Regularization
