Auto-regressive Image Synthesis with Integrated Quantization
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui,, Changgong Zhang, Shijian Lu

TL;DR
This paper introduces a novel framework for conditional image synthesis that combines integrated feature quantization, variational regularization, and Gumbel sampling to enhance diversity and fidelity in generated images.
Contribution
It proposes an integrated quantization scheme with a variational regularizer and a Gumbel sampling strategy, improving auto-regressive image generation performance.
Findings
Outperforms state-of-the-art in diverse image generation
Achieves higher fidelity and diversity in generated images
Effectively mitigates exposure bias during inference
Abstract
Deep generative models have achieved conspicuous progress in realistic image synthesis with multifarious conditional inputs, while generating diverse yet high-fidelity images remains a grand challenge in conditional image generation. This paper presents a versatile framework for conditional image generation which incorporates the inductive bias of CNNs and powerful sequence modeling of auto-regression that naturally leads to diverse image generation. Instead of independently quantizing the features of multiple domains as in prior research, we design an integrated quantization scheme with a variational regularizer that mingles the feature discretization in multiple domains, and markedly boosts the auto-regressive modeling performance. Notably, the variational regularizer enables to regularize feature distributions in incomparable latent spaces by penalizing the intra-domain variations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
