PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation
Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun,, Chao Chen, Rongrong Ji

TL;DR
PixelFolder introduces a progressive pixel synthesis network that significantly reduces computational and parameter costs in image generation while achieving state-of-the-art results on benchmark datasets.
Contribution
It proposes a novel multi-stage pixel regression framework with pixel folding operations for efficient image synthesis, outperforming existing methods in speed and accuracy.
Findings
Reduces 89% computation compared to CIPS.
Achieves state-of-the-art FID scores on FFHQ and LSUN datasets.
More efficient than StyleGAN2 with 72% less computation.
Abstract
Pixel synthesis is a promising research paradigm for image generation, which can well exploit pixel-wise prior knowledge for generation. However, existing methods still suffer from excessive memory footprint and computation overhead. In this paper, we propose a progressive pixel synthesis network towards efficient image generation, coined as PixelFolder. Specifically, PixelFolder formulates image generation as a progressive pixel regression problem and synthesizes images via a multi-stage structure, which can greatly reduce the overhead caused by large tensor transformations. In addition, we introduce novel pixel folding operations to further improve model efficiency while maintaining pixel-wise prior knowledge for end-to-end regression. With these innovative designs, we greatly reduce the expenditure of pixel synthesis, e.g., reducing 89% computation and 53% parameters compared with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsR1 Regularization · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Path Length Regularization · Weight Demodulation
