InSeGAN: A Generative Approach to Segmenting Identical Instances in   Depth Images

Anoop Cherian; Goncalo Dias Pais; Siddarth Jain; Tim K. Marks; and Alan Sullivan

arXiv:2108.13865·cs.CV·January 31, 2022

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, and Alan Sullivan

PDF

Open Access

TL;DR

InSeGAN is an unsupervised 3D GAN that synthesizes and segments multiple nearly identical object instances in depth images, enabling effective instance segmentation without labeled data.

Contribution

The paper introduces a novel GAN architecture with an analysis-by-synthesis approach for unsupervised segmentation of identical objects in depth images.

Findings

01

Achieves state-of-the-art segmentation performance on synthetic and real data.

02

Outperforms prior methods by large margins.

03

Introduces a new synthetic dataset, Insta-10.

Abstract

In this paper, we present InSeGAN, an unsupervised 3D generative adversarial network (GAN) for segmenting (nearly) identical instances of rigid objects in depth images. Using an analysis-by-synthesis approach, we design a novel GAN architecture to synthesize a multiple-instance depth image with independent control over each instance. InSeGAN takes in a set of code vectors (e.g., random noise vectors), each encoding the 3D pose of an object that is represented by a learned implicit object template. The generator has two distinct modules. The first module, the instance feature generator, uses each encoded pose to transform the implicit template into a feature map representation of each object instance. The second module, the depth image renderer, aggregates all of the single-instance feature maps output by the first module and generates a multiple-instance depth image. A discriminator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Image and Object Detection Techniques