EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational   Autoencoders

Gulcin Baykal; Melih Kandemir; Gozde Unal

arXiv:2310.05718·cs.CV·July 16, 2024

EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders

Gulcin Baykal, Melih Kandemir, Gozde Unal

PDF

Open Access 1 Repo

TL;DR

EdVAE introduces an evidential deep learning approach to address codebook collapse in discrete variational autoencoders, improving reconstruction quality and codebook utilization over traditional softmax-based methods.

Contribution

The paper proposes EdVAE, a novel method that replaces softmax with evidential deep learning to mitigate codebook collapse in dVAEs.

Findings

01

Mitigates codebook collapse effectively

02

Improves reconstruction performance

03

Enhances codebook utilization

Abstract

Codebook collapse is a common problem in training deep generative models with discrete representation spaces like Vector Quantized Variational Autoencoders (VQ-VAEs). We observe that the same problem arises for the alternatively designed discrete variational autoencoders (dVAEs) whose encoder directly learns a distribution over the codebook embeddings to represent the data. We hypothesize that using the softmax function to obtain a probability distribution causes the codebook collapse by assigning overconfident probabilities to the best matching codebook elements. In this paper, we propose a novel way to incorporate evidential deep learning (EDL) instead of softmax to combat the codebook collapse problem of dVAE. We evidentially monitor the significance of attaining the probability distribution over the codebook embeddings, in contrast to softmax usage. Our experiments using various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ituvisionlab/edvae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare

MethodsVQ-VAE · Softmax