Consistent Model Selection of Discrete Bayesian Networks from Incomplete Data
Nikolay H. Balov

TL;DR
This paper proposes a new method for selecting discrete Bayesian network models from incomplete data using a modified likelihood approach and analyzes its consistency, showing that standard BIC may not be reliable in this context.
Contribution
It introduces a consistent model selection procedure for discrete Bayesian networks with incomplete data, replacing the standard likelihood with node-average likelihood and analyzing BIC's limitations.
Findings
The proposed method is consistent when the penalty parameter decreases slower than n^{-1/2}.
Standard BIC is generally inconsistent for incomplete data Bayesian network selection.
Numerical examples confirm theoretical results.
Abstract
A maximum likelihood based model selection of discrete Bayesian networks is considered. The model selection is performed through scoring function , which, for a given network and -sample , is defined to be the maximum log-likelihood minus a penalization term proportional to network complexity , The data is allowed to have missing values at random that has prompted, to improve the efficiency of estimation, a replacement of the standard log-likelihood with the sum of sample average node log-likelihoods. The latter avoids the exclusion of most partially missing data records and allows the comparison of models fitted to different samples. Provided that a discrete Bayesian network is identifiable for a given missing data distribution, we show that if the sequence converges to zero at a slower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
