Natural Reweighted Wake-Sleep
Csongor V\'arady (1), Riccardo Volpi (2), Luigi Malag\`o (2) and, Nihat Ay (1) ((1) Institute for Data Science Foundations, Hamburg University, of Technology, Hamburg, Germany, (2) Transylvanian Institute of Neuroscience,, Cluj-Napoca, Romania)

TL;DR
This paper introduces Natural Reweighted Wake-Sleep (NRWS) and Natural Bidirectional Helmholtz Machine (NBiHM), leveraging the Fisher information matrix's structure for efficient natural gradient training of Helmholtz Machines, leading to improved convergence and likelihood.
Contribution
The paper proposes a novel natural gradient-based training algorithm for Helmholtz Machines that exploits the Fisher matrix's structure for efficiency and improved performance.
Findings
NRWS and NBiHM outperform non-geometric baselines.
Faster convergence and higher log-likelihood values achieved.
Efficient natural gradient computation without approximations.
Abstract
Helmholtz Machines (HMs) are a class of generative models composed of two Sigmoid Belief Networks (SBNs), acting respectively as an encoder and a decoder. These models are commonly trained using a two-step optimization algorithm called Wake-Sleep (WS) and more recently by improved versions, such as Reweighted Wake-Sleep (RWS) and Bidirectional Helmholtz Machines (BiHM). The locality of the connections in an SBN induces sparsity in the Fisher Information Matrices associated to the probabilistic models, in the form of a finely-grained block-diagonal structure. In this paper we exploit this property to efficiently train SBNs and HMs using the natural gradient. We present a novel algorithm, called Natural Reweighted Wake-Sleep (NRWS), that corresponds to the geometric adaptation of its standard version. In a similar manner, we also introduce Natural Bidirectional Helmholtz Machine (NBiHM).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Error Correcting Code Techniques · Advanced Image and Video Retrieval Techniques
