Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder
Seunghwan Kim, Seungkyu Lee

TL;DR
This paper introduces Beta-Sigma VAE, a novel model that explicitly separates beta and decoder variance to improve image synthesis quality and controllability, addressing the blurriness issue in traditional VAEs.
Contribution
The paper proposes Beta-Sigma VAE, which explicitly disentangles beta and decoder variance, enabling better analysis, controllability, and performance in generative modeling.
Findings
Beta-Sigma VAE outperforms conventional VAE in image synthesis.
Explicit separation of beta and variance improves model controllability.
Analysis of rate-distortion curves validates the effectiveness of the approach.
Abstract
Variational autoencoder (VAE) is an established generative model but is notorious for its blurriness. In this work, we investigate the blurry output problem of VAE and resolve it, exploiting the variance of Gaussian decoder and of beta-VAE. Specifically, we reveal that the indistinguishability of decoder variance and hinders appropriate analysis of the model by random likelihood value, and limits performance improvement by omitting the gain from . To address the problem, we propose Beta-Sigma VAE (BS-VAE) that explicitly separates and decoder variance in the model. Our method demonstrates not only superior performance in natural image synthesis but also controllable parameters and predictable analysis compared to conventional VAE. In our experimental evaluation, we employ the analysis of rate-distortion curve and proxy metrics on computer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
