A Confident Information First Principle for Parametric Reduction and Model Selection of Boltzmann Machines
Xiaozhao Zhao, Yuexian Hou, Dawei Song, Wenjie Li

TL;DR
This paper introduces the Confident-Information-First (CIF) principle for model-oriented dimensionality reduction in parameter spaces, specifically applied to Boltzmann machines, to improve model selection and density estimation accuracy.
Contribution
It proposes a novel CIF criterion based on Fisher information to identify confident parameters, and applies this to derive and select Boltzmann machine models from a theoretical perspective.
Findings
CIF effectively preserves essential parameters in density estimation.
Sample-specific CIF improves model selection accuracy.
The method enhances density estimation performance in experiments.
Abstract
Typical dimensionality reduction (DR) methods are often data-oriented, focusing on directly reducing the number of random variables (features) while retaining the maximal variations in the high-dimensional data. In unsupervised situations, one of the main limitations of these methods lies in their dependency on the scale of data features. This paper aims to address the problem from a new perspective and considers model-oriented dimensionality reduction in parameter spaces of binary multivariate distributions. Specifically, we propose a general parameter reduction criterion, called Confident-Information-First (CIF) principle, to maximally preserve confident parameters and rule out less confident parameters. Formally, the confidence of each parameter can be assessed by its contribution to the expected Fisher information distance within the geometric manifold over the neighbourhood of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
