Distributional Principal Autoencoders
Xinwei Shen, Nicolai Meinshausen

TL;DR
Distributional Principal Autoencoders (DPA) aim to achieve dimension reduction while preserving the original data distribution, ensuring reconstructed data are identically distributed as the original, regardless of the reduced dimension.
Contribution
The paper introduces DPA, a novel autoencoder that matches the conditional data distribution given latent variables, enabling distribution-preserving dimension reduction.
Findings
DPA successfully reconstructs original data distribution across climate, single-cell, and image datasets.
DPA embeddings preserve meaningful data structures like seasonal cycles and cell types.
Numerical results demonstrate DPA's practical feasibility and effectiveness.
Abstract
Dimension reduction techniques usually lose information in the sense that reconstructed data are not identical to the original data. However, we argue that it is possible to have reconstructed data identically distributed as the original data, irrespective of the retained dimension or the specific mapping. This can be achieved by learning a distributional model that matches the conditional distribution of data given its low-dimensional latent variables. Motivated by this, we propose Distributional Principal Autoencoder (DPA) that consists of an encoder that maps high-dimensional data to low-dimensional latent variables and a decoder that maps the latent variables back to the data space. For reducing the dimension, the DPA encoder aims to minimise the unexplained variability of the data with an adaptive choice of the latent dimension. For reconstructing data, the DPA decoder aims to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
