Spatial Broadcast Decoder: A Simple Architecture for Learning Disentangled Representations in VAEs
Nicholas Watters, Loic Matthey, Christopher P. Burgess, Alexander, Lerchner

TL;DR
The paper introduces the Spatial Broadcast decoder, a simple convolutional architecture for VAEs that enhances disentanglement, reconstruction, and generalization without explicit supervision, especially effective for small object datasets.
Contribution
It proposes a novel architectural prior using spatial broadcasting in VAEs, improving disentanglement and generalization without additional supervision.
Findings
Improves disentangling and reconstruction accuracy.
Enhances generalization to held-out regions.
Complementary to state-of-the-art disentangling methods.
Abstract
We present a simple neural rendering architecture that helps variational autoencoders (VAEs) learn disentangled representations. Instead of the deconvolutional network typically used in the decoder of VAEs, we tile (broadcast) the latent vector across space, concatenate fixed X- and Y-"coordinate" channels, and apply a fully convolutional network with 1x1 stride. This provides an architectural prior for dissociating positional from non-positional features in the latent distribution of VAEs, yet without providing any explicit supervision to this effect. We show that this architecture, which we term the Spatial Broadcast decoder, improves disentangling, reconstruction accuracy, and generalization to held-out regions in data space. It provides a particularly dramatic benefit when applied to datasets with small objects. We also emphasize a method for visualizing learned latent spaces that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Chaos-based Image/Signal Encryption · Network Security and Intrusion Detection
MethodsSpatial Broadcast Decoder
