Style-Guided Inference of Transformer for High-resolution Image   Synthesis

Jonghwa Yim; Minjae Kim

arXiv:2210.05533·cs.CV·October 12, 2022

Style-Guided Inference of Transformer for High-resolution Image Synthesis

Jonghwa Yim, Minjae Kim

PDF

Open Access 3 Videos

TL;DR

This paper introduces a style-guided method for high-resolution image synthesis using transformers, enabling style control without retraining by re-balancing the prior with style constraints.

Contribution

It proposes a novel style transfer approach that conditions transformer-based image synthesis on style images, improving output style consistency without retraining.

Findings

01

Generated images match input style in most cases

02

Method reduces the need for multiple samples to achieve desired style

03

Style control is achieved without retraining the transformer

Abstract

Transformer is eminently suitable for auto-regressive image synthesis which predicts discrete value from the past values recursively to make up full image. Especially, combined with vector quantised latent representation, the state-of-the-art auto-regressive transformer displays realistic high-resolution images. However, sampling the latent code from discrete probability distribution makes the output unpredictable. Therefore, it requires to generate lots of diverse samples to acquire desired outputs. To alleviate the process of generating lots of samples repetitively, in this article, we propose to take a desired output, a style image, as an additional condition without re-training the transformer. To this end, our method transfers the style to a probability constraint to re-balance the prior, thereby specifying the target distribution instead of the original prior. Thus, generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Style-Guided Inference of Transformer for High-resolution Image Synthesis· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques