Toward Spatially Unbiased Generative Models

Jooyoung Choi; Jungbeom Lee; Yonghyun Jeong; Sungroh Yoon

arXiv:2108.01285·cs.LG·August 4, 2021

Toward Spatially Unbiased Generative Models

Jooyoung Choi, Jungbeom Lee, Yonghyun Jeong, Sungroh Yoon

PDF

2 Repos

TL;DR

This paper identifies spatial bias in image generators caused by implicit positional encoding and proposes injecting explicit positional encoding to create spatially unbiased models, improving robustness across various tasks.

Contribution

It introduces a method to inject explicit positional encoding into generators, reducing spatial bias and enhancing their versatility in multiple image generation tasks.

Findings

01

Reduced spatial bias in generators

02

Improved performance in multi-scale and arbitrary size generation

03

Applicable to diffusion models as well

Abstract

Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We argue that the generators rely on their implicit positional encoding to render spatial content. From our observations, the generator's implicit positional encoding is translation-variant, making the generator spatially biased. To address this issue, we propose injecting explicit positional encoding at each scale of the generator. By learning the spatially unbiased generator, we facilitate the robust use of generators in multiple tasks, such as GAN inversion, multi-scale generation, generation of arbitrary sizes and aspect ratios. Furthermore, we show that our method can also be applied to denoising diffusion probabilistic models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion