Authoring image decompositions with generative models

Jason Rock; Theerasit Issaranon; Aditya Deshpande; David Forsyth

arXiv:1612.01479·cs.CV·December 6, 2016

Authoring image decompositions with generative models

Jason Rock, Theerasit Issaranon, Aditya Deshpande, David Forsyth

PDF

Open Access

TL;DR

This paper introduces a method for authoring layered image decompositions using generative models, enabling flexible multi-layer representations without requiring physical interpretability.

Contribution

It develops a novel convolutional VAE architecture and a framework for creating layered image decompositions guided by proxy examples.

Findings

01

The conv-VAE reconstructs high-fidelity images effectively.

02

The method can generate layered decompositions that explain input images.

03

It extends intrinsic image decomposition to multiple layers with generative models.

Abstract

We show how to extend traditional intrinsic image decompositions to incorporate further layers above albedo and shading. It is hard to obtain data to learn a multi-layer decomposition. Instead, we can learn to decompose an image into layers that are "like this" by authoring generative models for each layer using proxy examples that capture the Platonic ideal (Mondrian images for albedo; rendered 3D primitives for shading; material swatches for shading detail). Our method then generates image layers, one from each model, that explain the image. Our approach rests on innovation in generative models for images. We introduce a Convolutional Variational Auto Encoder (conv-VAE), a novel VAE architecture that can reconstruct high fidelity images. The approach is general, and does not require that layers admit a physical interpretation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging

MethodsUSD Coin Customer Service Number +1-833-534-1729