Explicitly disentangling image content from translation and rotation   with spatial-VAE

Tristan Bepler; Ellen D. Zhong; Kotaro Kelley; Edward Brignole; and; Bonnie Berger

arXiv:1909.11663·cs.CV·September 27, 2019·34 cites

Explicitly disentangling image content from translation and rotation with spatial-VAE

Tristan Bepler, Ellen D. Zhong, Kotaro Kelley, Edward Brignole, and, Bonnie Berger

PDF

Open Access 1 Repo

TL;DR

This paper introduces spatial-VAE, a novel variational autoencoder framework that explicitly disentangles image content from pose variables like rotation and translation, leading to improved reconstruction and interpretability.

Contribution

The paper proposes a new method for explicitly disentangling pose from content in VAEs by formulating the generative model as a function of spatial coordinates, enabling better control over latent factors.

Findings

01

Spatial-VAE effectively disentangles rotation and translation from content.

02

The method improves reconstruction quality over standard VAEs.

03

Applications include modeling 2-D views of proteins and galaxies.

Abstract

Given an image dataset, we are often interested in finding data generative factors that encode semantic content independently from pose variables such as rotation and translation. However, current disentanglement approaches do not impose any specific structure on the learned latent representations. We propose a method for explicitly disentangling image rotation and translation from other unstructured latent factors in a variational autoencoder (VAE) framework. By formulating the generative model as a function of the spatial coordinate, we make the reconstruction error differentiable with respect to latent translation and rotation parameters. This formulation allows us to train a neural network to perform approximate inference on these latent variables while explicitly constraining them to only represent rotation and translation. We demonstrate that this framework, termed spatial-VAE,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tbepler/spatial-VAE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Electron Microscopy Techniques and Applications · Computational Physics and Python Applications · Cell Image Analysis Techniques

MethodsSolana Customer Service Number +1-833-534-1729