Consistent Model Selection of Discrete Bayesian Networks from Incomplete   Data

Nikolay H. Balov

arXiv:1105.4507·math.ST·April 18, 2013

Consistent Model Selection of Discrete Bayesian Networks from Incomplete Data

Nikolay H. Balov

PDF

TL;DR

This paper proposes a new method for selecting discrete Bayesian network models from incomplete data using a modified likelihood approach and analyzes its consistency, showing that standard BIC may not be reliable in this context.

Contribution

It introduces a consistent model selection procedure for discrete Bayesian networks with incomplete data, replacing the standard likelihood with node-average likelihood and analyzing BIC's limitations.

Findings

01

The proposed method is consistent when the penalty parameter decreases slower than n^{-1/2}.

02

Standard BIC is generally inconsistent for incomplete data Bayesian network selection.

03

Numerical examples confirm theoretical results.

Abstract

A maximum likelihood based model selection of discrete Bayesian networks is considered. The model selection is performed through scoring function $S$ , which, for a given network $G$ and $n$ -sample $D_{n}$ , is defined to be the maximum log-likelihood $l$ minus a penalization term $λ_{n} h$ proportional to network complexity $h (G)$ , $S (G ∣ D_{n}) = l (G ∣ D_{n}) - λ_{n} h (G) .$ The data is allowed to have missing values at random that has prompted, to improve the efficiency of estimation, a replacement of the standard log-likelihood with the sum of sample average node log-likelihoods. The latter avoids the exclusion of most partially missing data records and allows the comparison of models fitted to different samples. Provided that a discrete Bayesian network is identifiable for a given missing data distribution, we show that if the sequence $λ_{n}$ converges to zero at a slower…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.