TL;DR
This paper introduces a spherical VAE framework with cluster-aware constraints that guarantees prevention of posterior collapse, ensuring informative latent representations without restricting decoder capacity.
Contribution
It provides a novel theoretical approach leveraging spherical geometry and cluster constraints to strictly prevent posterior collapse in VAEs.
Findings
Achieves 100% collapse prevention on synthetic and real datasets.
Matches or exceeds state-of-the-art reconstruction quality.
Requires no explicit stability conditions or architecture restrictions.
Abstract
Variational autoencoders (VAEs) frequently suffer from posterior collapse, where the latent variables become uninformative as the approximate posterior degenerates to the prior. While recent work has characterized collapse as a phase transition determined by data covariance properties, existing approaches primarily aim to avoid rather than eliminate collapse. We introduce a novel framework that theoretically guarantees non-collapsed solutions by leveraging spherical shell geometry and cluster-aware constraints. Our method transforms data to a spherical shell, computes optimal cluster assignments via K-means, and defines a feasible region between the within-cluster variance and collapse loss . We prove that when the reconstruction loss is constrained to this region, the collapsed solution is mathematically excluded from the feasible parameter space.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Bayesian Methods and Mixture Models · Face recognition and analysis
