TL;DR
This paper introduces pixel2style2pixel (pSp), a versatile encoder framework that embeds real images into StyleGAN's latent space for efficient, multi-modal image-to-image translation without adversarial training.
Contribution
The paper presents a novel encoder that directly maps images into StyleGAN's extended latent space, enabling flexible and simplified image translation tasks.
Findings
Encoder can embed real images into W+ without optimization
Framework supports multi-modal synthesis through style resampling
Effective on various facial translation tasks and extendable beyond faces
Abstract
We present a generic image-to-image translation framework, pixel2style2pixel (pSp). Our pSp framework is based on a novel encoder network that directly generates a series of style vectors which are fed into a pretrained StyleGAN generator, forming the extended W+ latent space. We first show that our encoder can directly embed real images into W+, with no additional optimization. Next, we propose utilizing our encoder to directly solve image-to-image translation tasks, defining them as encoding problems from some input domain into the latent domain. By deviating from the standard invert first, edit later methodology used with previous StyleGAN encoders, our approach can handle a variety of tasks even when the input image is not represented in the StyleGAN domain. We show that solving translation tasks through StyleGAN significantly simplifies the training process, as no adversary is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBitcoin Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Bottleneck Residual Block · Kaiming Initialization · Max Pooling · Average Pooling · Global Average Pooling · Residual Connection · Batch Normalization
