Improving parameter learning of Bayesian nets from incomplete data
Giorgio Corani, Cassio P. De Campos

TL;DR
This paper proposes new methods based on maximum entropy and Bayesian model averaging to improve parameter estimation of Bayesian networks from incomplete data, outperforming traditional maximum likelihood approaches.
Contribution
It introduces two novel approaches that enhance Bayesian network parameter learning from incomplete data, addressing overfitting and model uncertainty issues.
Findings
Bayesian model averaging outperforms traditional methods with EM.
Maximum entropy approach matches model averaging when using a non-linear solver.
Proposed methods significantly improve inference accuracy over standard techniques.
Abstract
This paper addresses the estimation of parameters of a Bayesian network from incomplete data. The task is usually tackled by running the Expectation-Maximization (EM) algorithm several times in order to obtain a high log-likelihood estimate. We argue that choosing the maximum log-likelihood estimate (as well as the maximum penalized log-likelihood and the maximum a posteriori estimate) has severe drawbacks, being affected both by overfitting and model uncertainty. Two ideas are discussed to overcome these issues: a maximum entropy approach and a Bayesian model averaging approach. Both ideas can be easily applied on top of EM, while the entropy idea can be also implemented in a more sophisticated way, through a dedicated non-linear solver. A vast set of experiments shows that these ideas produce significantly better estimates and inferences than the traditional and widely used maximum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
