A Two-round Variant of EM for Gaussian Mixtures
Sanjoy Dasgupta, Leonard Schulman

TL;DR
This paper evaluates different model selection criteria for supervised Bayesian network tasks, finding that the Dawids prequential approach outperforms the traditional marginal likelihood score in predictive accuracy.
Contribution
The study empirically compares various supervised model selection methods, highlighting the superiority of Dawids prequential over marginal likelihood in classification tasks.
Findings
Marginal likelihood performs poorly in supervised model selection.
Dawids prequential approach yields the best predictive performance.
Empirical results are based on numerous publicly available datasets.
Abstract
Given a set of possible models (e.g., Bayesian network structures) and a data sample, in the unsupervised model selection problem the task is to choose the most accurate model with respect to the domain joint probability distribution. In contrast to this, in supervised model selection it is a priori known that the chosen model will be used in the future for prediction tasks involving more ``focused' predictive distributions. Although focused predictive distributions can be produced from the joint probability distribution by marginalization, in practice the best model in the unsupervised sense does not necessarily perform well in supervised domains. In particular, the standard marginal likelihood score is a criterion for the unsupervised task, and, although frequently used for supervised model selection also, does not perform well in such tasks. In this paper we study the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Algorithms and Data Compression · Statistical Methods and Inference
