Learning to Generate Novel Scene Compositions from Single Images and   Videos

Vadim Sushko; Juergen Gall; Anna Khoreva

arXiv:2105.05847·cs.CV·May 13, 2021

Learning to Generate Novel Scene Compositions from Single Images and Videos

Vadim Sushko, Juergen Gall, Anna Khoreva

PDF

Open Access 1 Repo

TL;DR

This paper introduces One-Shot GAN, a novel model capable of generating diverse and high-quality scene compositions from just one image or video, addressing low-data training challenges.

Contribution

It proposes a two-branch discriminator architecture that enables learning from a single image or video, improving diversity and quality in scene synthesis.

Findings

01

Achieves higher diversity and quality than previous single-image GANs.

02

Successfully learns from a single video, extending the single-image setting.

03

Generates plausible, novel scene compositions with preserved context.

Abstract

Training GANs in low-data regimes remains a challenge, as overfitting often leads to memorization or training divergence. In this work, we introduce One-Shot GAN that can learn to generate samples from a training set as little as one image or one video. We propose a two-branch discriminator, with content and layout branches designed to judge the internal content separately from the scene layout realism. This allows synthesis of visually plausible, novel compositions of a scene, with varying content and layout, while preserving the context of the original sample. Compared to previous single-image GAN models, One-Shot GAN achieves higher diversity and quality of synthesis. It is also not restricted to the single image setting, successfully learning in the introduced setting of a single video.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boschresearch/one-shot-synthesis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis