Collapse Resistant Deep Convolutional GAN for Multi-Object Image   Generation

Elijah D. Bolluyt; Cristina Comaniciu

arXiv:1911.02996·cs.LG·November 11, 2019

Collapse Resistant Deep Convolutional GAN for Multi-Object Image Generation

Elijah D. Bolluyt, Cristina Comaniciu

PDF

TL;DR

This paper presents a new conditional deep convolutional GAN architecture capable of generating multi-object images with diverse classes, realistic arrangements, and resistance to mode collapse, advancing multi-object image synthesis.

Contribution

The work introduces a novel, collapse-resistant conditional DCGAN architecture that learns to generate complex multi-object images with realistic spatial relationships.

Findings

01

Successfully generates multi-object images with diverse class combinations

02

Demonstrates stability against mode collapse during training

03

Produces images with realistic object arrangements and occlusions

Abstract

This work introduces a novel system for the generation of images that contain multiple classes of objects. Recent work in Generative Adversarial Networks have produced high quality images, but many focus on generating images of a single object or set of objects. Our system addresses the task of image generation conditioned on a list of desired classes to be included in a single image. This enables our system to generate images with any given combination of objects, all composed into a visually realistic natural image. The system learns the interrelationships of all classes represented in a dataset, and can generate diverse samples including a set of these classes. It displays the ability to arrange these objects together, accounting for occlusions and inter-object spatial relations that characterize complex natural images. To accomplish this, we introduce a novel architecture based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.