Rethinking Probabilistic Circuit Parameter Learning
Anji Liu, Zilei Shao, Guy Van den Broeck

TL;DR
This paper introduces a novel mini-batch EM algorithm called anemone for probabilistic circuits, addressing training challenges and improving convergence speed and performance on various datasets.
Contribution
The paper establishes a theoretical connection between mini-batch algorithms and EM, and proposes anemone, an adaptive mini-batch EM method that outperforms existing optimizers.
Findings
Anemone outperforms existing methods in convergence speed.
Anemone achieves better final likelihood on diverse datasets.
Theoretical analysis explains the limitations of current mini-batch EM methods.
Abstract
Probabilistic Circuits (PCs) offer a computationally scalable framework for generative modeling, supporting exact and efficient inference of a wide range of probabilistic queries. While recent advances have significantly improved the expressiveness and scalability of PCs, effectively training their parameters remains a challenge. In particular, a widely used optimization method, full-batch Expectation-Maximization (EM), requires processing the entire dataset before performing a single update, making it ineffective for large datasets. Although empirical extensions to the mini-batch setting, as well as gradient-based mini-batch algorithms, converge faster than full-batch EM, they generally underperform in terms of final likelihood. We investigate this gap by establishing a novel theoretical connection between these practical algorithms and the general EM objective. Our analysis reveals a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVLSI and Analog Circuit Testing · Low-power high-performance VLSI design · VLSI and FPGA Design Techniques
