Beta-Sigma VAE: Separating beta and decoder variance in Gaussian   variational autoencoder

Seunghwan Kim; Seungkyu Lee

arXiv:2409.09361·cs.LG·September 17, 2024

Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Seunghwan Kim, Seungkyu Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces Beta-Sigma VAE, a novel model that explicitly separates beta and decoder variance to improve image synthesis quality and controllability, addressing the blurriness issue in traditional VAEs.

Contribution

The paper proposes Beta-Sigma VAE, which explicitly disentangles beta and decoder variance, enabling better analysis, controllability, and performance in generative modeling.

Findings

01

Beta-Sigma VAE outperforms conventional VAE in image synthesis.

02

Explicit separation of beta and variance improves model controllability.

03

Analysis of rate-distortion curves validates the effectiveness of the approach.

Abstract

Variational autoencoder (VAE) is an established generative model but is notorious for its blurriness. In this work, we investigate the blurry output problem of VAE and resolve it, exploiting the variance of Gaussian decoder and $β$ of beta-VAE. Specifically, we reveal that the indistinguishability of decoder variance and $β$ hinders appropriate analysis of the model by random likelihood value, and limits performance improvement by omitting the gain from $β$ . To address the problem, we propose Beta-Sigma VAE (BS-VAE) that explicitly separates $β$ and decoder variance $σ_{x}^{2}$ in the model. Our method demonstrates not only superior performance in natural image synthesis but also controllable parameters and predictable analysis compared to conventional VAE. In our experimental evaluation, we employ the analysis of rate-distortion curve and proxy metrics on computer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

overnap/bs-vae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications