SHOT-VAE: Semi-supervised Deep Generative Models With Label-aware ELBO Approximations
Hao-Zhe Feng, Kezhi Kong, Minghao Chen, Tianye Zhang, Minfeng Zhu, Wei, Chen

TL;DR
This paper introduces SHOT-VAE, a semi-supervised VAE that improves inference accuracy by integrating label information into ELBO and overcoming the ELBO bottleneck, leading to state-of-the-art results on CIFAR datasets.
Contribution
SHOT-VAE proposes a new ELBO approximation called smooth-ELBO and an optimal interpolation method to enhance semi-supervised VAE performance without extra prior knowledge.
Findings
Achieves 25.30% error on CIFAR-100 with 10k labels
Reduces error to 6.11% on CIFAR-10 with 4k labels
Effectively addresses ELBO bottleneck and label utilization issues
Abstract
Semi-supervised variational autoencoders (VAEs) have obtained strong results, but have also encountered the challenge that good ELBO values do not always imply accurate inference results. In this paper, we investigate and propose two causes of this problem: (1) The ELBO objective cannot utilize the label information directly. (2) A bottleneck value exists and continuing to optimize ELBO after this value will not improve inference accuracy. On the basis of the experiment results, we propose SHOT-VAE to address these problems without introducing additional prior knowledge. The SHOT-VAE offers two contributions: (1) A new ELBO approximation named smooth-ELBO that integrates the label predictive loss into ELBO. (2) An approximation based on optimal interpolation that breaks the ELBO value bottleneck by reducing the margin between ELBO and the data likelihood. The SHOT-VAE achieves good…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Cancer-related molecular mechanisms research · Domain Adaptation and Few-Shot Learning
MethodsMixup · Stochastic Gradient Variational Bayes
