TL;DR
This paper proposes an adversarial training method for variational auto-encoders that enhances image quality by combining reconstruction, distribution matching, and perceptual guidance, resulting in high-fidelity, diverse, high-resolution images.
Contribution
It introduces an auto-encoder based discriminator for adversarial training of VAEs, improving image sharpness and diversity over traditional methods.
Findings
Generated images are high-resolution and diverse.
The model reduces blurriness and over-smoothing.
Training stabilizes through an error feedback mechanism.
Abstract
Variational auto-encoders (VAEs) provide an attractive solution to image generation problem. However, they tend to produce blurred and over-smoothed images due to their dependence on pixel-wise reconstruction loss. This paper introduces a new approach to alleviate this problem in the VAE based generative models. Our model simultaneously learns to match the data, reconstruction loss and the latent distributions of real and fake images to improve the quality of generated samples. To compute the loss distributions, we introduce an auto-encoder based discriminator model which allows an adversarial learning procedure. The discriminator in our model also provides perceptual guidance to the VAE by matching the learned similarity metric of the real and fake samples in the latent space. To stabilize the overall training process, our model uses an error feedback approach to maintain the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsUSD Coin Customer Service Number +1-833-534-1729
