PetsGAN: Rethinking Priors for Single Image Generation
Zicheng Zhang, Yinglu Liu, Congying Han, Hailin Shi, Tiande Guo, Bowen, Zhou

TL;DR
PetsGAN introduces a regularized latent variable model and an end-to-end training approach for single image generation, significantly improving quality, diversity, and training efficiency over previous methods like SinGAN.
Contribution
The paper proposes a novel regularized latent variable model and PetsGAN, an end-to-end trainable GAN that overcomes SinGAN's limitations in quality and training time.
Findings
PetsGAN outperforms SinGAN in image quality and diversity.
The method reduces training time significantly.
Effective in various image manipulation tasks.
Abstract
Single image generation (SIG), described as generating diverse samples that have similar visual content with the given single image, is first introduced by SinGAN which builds a pyramid of GANs to progressively learn the internal patch distribution of the single image. It also shows great potentials in a wide range of image manipulation tasks. However, the paradigm of SinGAN has limitations in terms of generation quality and training time. Firstly, due to the lack of high-level information, SinGAN cannot handle the object images well as it does on the scene and texture images. Secondly, the separate progressive training scheme is time-consuming and easy to cause artifact accumulation. To tackle these problems, in this paper, we dig into the SIG problem and improve SinGAN by fully-utilization of internal and external priors. The main contributions of this paper include: 1) We introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsNormalizing Flows · Sliced Iterative Generator
