Truncated Variational EM for Semi-Supervised Neural Simpletrons
Dennis Forster, J\"org L\"ucke

TL;DR
This paper introduces a novel truncated variational EM method applied to Neural Simpletrons, improving training efficiency, convergence speed, and classification accuracy in semi-supervised learning tasks with large-scale generative neural networks.
Contribution
It combines Neural Simpletrons with TV-EM, providing a scalable, theoretically grounded approach that enhances learning efficiency and performance in semi-supervised neural generative models.
Findings
Fewer EM iterations needed for convergence
Higher likelihood values on benchmarks
Lower error rates in semi-supervised classification
Abstract
Inference and learning for probabilistic generative networks is often very challenging and typically prevents scalability to as large networks as used for deep discriminative approaches. To obtain efficiently trainable, large-scale and well performing generative networks for semi-supervised learning, we here combine two recent developments: a neural network reformulation of hierarchical Poisson mixtures (Neural Simpletrons), and a novel truncated variational EM approach (TV-EM). TV-EM provides theoretical guarantees for learning in generative networks, and its application to Neural Simpletrons results in particularly compact, yet approximately optimal, modifications of learning equations. If applied to standard benchmarks, we empirically find, that learning converges in fewer EM iterations, that the complexity per EM iteration is reduced, and that final likelihood values are higher on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Topic Modeling
