Simplex Autoencoders
Aymene Mohammed Bouayed, David Naccache

TL;DR
This paper introduces a novel simplex-based modeling approach for Autoencoder latent spaces, enabling efficient synthetic data generation with improved FID scores on benchmark datasets, especially MNIST.
Contribution
It proposes modeling Autoencoder latent spaces as a simplex with a new heuristic for mixture component determination and a probability mass function sampling method.
Findings
Achieves state-of-the-art FID on MNIST with 4.29
Significantly improves FID on CIFAR-10 and Celeba compared to previous Autoencoder results
Outperforms existing Autoencoder methods on MNIST by a notable margin
Abstract
Synthetic data generation is increasingly important due to privacy concerns. While Autoencoder-based approaches have been widely used for this purpose, sampling from their latent spaces can be challenging. Mixture models are currently the most efficient way to sample from these spaces. In this work, we propose a new approach that models the latent space of an Autoencoder as a simplex, allowing for a novel heuristic for determining the number of components in the mixture model. This heuristic is independent of the number of classes and produces comparable results. We also introduce a sampling method based on probability mass functions, taking advantage of the compactness of the latent space. We evaluate our approaches on a synthetic dataset and demonstrate their performance on three benchmark datasets: MNIST, CIFAR-10, and Celeba. Our approach achieves an image generation FID of 4.29,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · AI in cancer detection · Bayesian Methods and Mixture Models
MethodsAutoencoders
