On Supervised Selection of Bayesian Networks
Petri Kontkanen, Petri Myllymaki, Tomi Silander, Henry Tirri

TL;DR
This paper evaluates various model selection criteria for Bayesian networks in supervised tasks, finding that the Dawids prequential approach outperforms the traditional marginal likelihood score in predictive accuracy.
Contribution
It provides an empirical comparison of supervised model selection methods, highlighting the limitations of marginal likelihood and the effectiveness of Dawids prequential criterion.
Findings
Marginal likelihood performs poorly in supervised Bayesian network selection.
Dawids prequential approach yields the best predictive performance.
Empirical results are based on numerous publicly available classification datasets.
Abstract
Given a set of possible models (e.g., Bayesian network structures) and a data sample, in the unsupervised model selection problem the task is to choose the most accurate model with respect to the domain joint probability distribution. In contrast to this, in supervised model selection it is a priori known that the chosen model will be used in the future for prediction tasks involving more ``focused' predictive distributions. Although focused predictive distributions can be produced from the joint probability distribution by marginalization, in practice the best model in the unsupervised sense does not necessarily perform well in supervised domains. In particular, the standard marginal likelihood score is a criterion for the unsupervised task, and, although frequently used for supervised model selection also, does not perform well in such tasks. In this paper we study the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Fault Detection and Control Systems · Machine Learning and Data Classification
