InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, and Alan Sullivan

TL;DR
InSeGAN is an unsupervised 3D GAN that synthesizes and segments multiple nearly identical object instances in depth images, enabling effective instance segmentation without labeled data.
Contribution
The paper introduces a novel GAN architecture with an analysis-by-synthesis approach for unsupervised segmentation of identical objects in depth images.
Findings
Achieves state-of-the-art segmentation performance on synthetic and real data.
Outperforms prior methods by large margins.
Introduces a new synthetic dataset, Insta-10.
Abstract
In this paper, we present InSeGAN, an unsupervised 3D generative adversarial network (GAN) for segmenting (nearly) identical instances of rigid objects in depth images. Using an analysis-by-synthesis approach, we design a novel GAN architecture to synthesize a multiple-instance depth image with independent control over each instance. InSeGAN takes in a set of code vectors (e.g., random noise vectors), each encoding the 3D pose of an object that is represented by a learned implicit object template. The generator has two distinct modules. The first module, the instance feature generator, uses each encoded pose to transform the implicit template into a feature map representation of each object instance. The second module, the depth image renderer, aggregates all of the single-instance feature maps output by the first module and generates a multiple-instance depth image. A discriminator…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Image and Object Detection Techniques
