A theoretical perspective on mode collapse in variational inference
Roman Soletskyi, Marylou Gabri\'e, Bruno Loureiro

TL;DR
This paper provides a theoretical analysis of mode collapse in variational inference, revealing key mechanisms and offering insights into its occurrence even in favorable scenarios, especially in the context of normalizing flows.
Contribution
It introduces a theoretical framework for understanding mode collapse in VI via gradient flow on Gaussian mixture models, identifying mechanisms like mean alignment and vanishing weights.
Findings
Mode collapse occurs even in statistically favorable scenarios.
Two mechanisms driving collapse are mean alignment and vanishing weights.
Theoretical insights align with practical implementations using normalizing flows.
Abstract
While deep learning has expanded the possibilities for highly expressive variational families, the practical benefits of these tools for variational inference (VI) are often limited by the minimization of the traditional Kullback-Leibler objective, which can yield suboptimal solutions. A major challenge in this context is \emph{mode collapse}: the phenomenon where a model concentrates on a few modes of the target distribution during training, despite being statistically capable of expressing them all. In this work, we carry a theoretical investigation of mode collapse for the gradient flow on Gaussian mixture models. We identify the key low-dimensional statistics characterizing the flow, and derive a closed set of low-dimensional equations governing their evolution. Leveraging this compact description, we show that mode collapse is present even in statistically favorable scenarios, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsYersinia bacterium, plague, ectoparasites research · Spectroscopy Techniques in Biomedical and Chemical Research · Image Processing Techniques and Applications
MethodsVariational Inference · Sparse Evolutionary Training
