Bias-Variance Trade-Off in Hierarchical Probabilistic Models Using   Higher-Order Feature Interactions

Simon Luo; Mahito Sugiyama

arXiv:1906.12063·stat.ML·July 1, 2019·1 cites

Bias-Variance Trade-Off in Hierarchical Probabilistic Models Using Higher-Order Feature Interactions

Simon Luo, Mahito Sugiyama

PDF

Open Access 1 Repo

TL;DR

This paper investigates the bias-variance trade-off in hierarchical probabilistic models, specifically higher-order Boltzmann machines, using a new inference algorithm and bias-variance decomposition.

Contribution

It introduces an efficient inference method for higher-order Boltzmann machines and analyzes the bias-variance trade-off between hidden layers and higher-order interactions.

Findings

01

Higher-order interactions produce less variance with small sample sizes.

02

Hidden layers and higher-order interactions have comparable errors.

03

The study provides insights into model complexity and generalization.

Abstract

Hierarchical probabilistic models are able to use a large number of parameters to create a model with a high representation power. However, it is well known that increasing the number of parameters also increases the complexity of the model which leads to a bias-variance trade-off. Although it is a classical problem, the bias-variance trade-off between hidden layers and higher-order interactions have not been well studied. In our study, we propose an efficient inference algorithm for the log-linear formulation of the higher-order Boltzmann machine using a combination of Gibbs sampling and annealed importance sampling. We then perform a bias-variance decomposition to study the differences in hidden layers and higher-order interactions. Our results have shown that using hidden layers and higher-order interactions have a comparable error with a similar order of magnitude and using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sjmluo/HBM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods