A Complete Decomposition of KL Error using Refined Information and Mode Interaction Selection
James Enouen, Mahito Sugiyama

TL;DR
This paper introduces a new information geometric approach to decompose KL error in log-linear models, emphasizing higher-order interactions, and proposes an algorithm for sparse mode selection that improves data efficiency.
Contribution
It presents a complete KL error decomposition focusing on higher-order modes and develops MAHGenTa, a novel algorithm for sparse interaction selection in energy-based models.
Findings
MAHGenTa effectively maximizes log-likelihood on synthetic and real datasets.
The approach improves data efficiency by selecting relevant mode interactions.
The method adapts well to both generative and discriminative tasks.
Abstract
The log-linear model has received a significant amount of theoretical attention in previous decades and remains the fundamental tool used for learning probability distributions over discrete variables. Despite its large popularity in statistical mechanics and high-dimensional statistics, the majority of related energy-based models only focus on the two-variable relationships, such as Boltzmann machines and Markov graphical models. Although these approaches have easier-to-solve structure learning problems and easier-to-optimize parametric distributions, they often ignore the rich structure which exists in the higher-order interactions between different variables. Using more recent tools from the field of information geometry, we revisit the classical formulation of the log-linear model with a focus on higher-order mode interactions, going beyond the 1-body modes of independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
