A Two-round Variant of EM for Gaussian Mixtures

Sanjoy Dasgupta; Leonard Schulman

arXiv:1301.3850·cs.LG·January 18, 2013·144 cites

A Two-round Variant of EM for Gaussian Mixtures

Sanjoy Dasgupta, Leonard Schulman

PDF

Open Access

TL;DR

This paper evaluates different model selection criteria for supervised Bayesian network tasks, finding that the Dawids prequential approach outperforms the traditional marginal likelihood score in predictive accuracy.

Contribution

The study empirically compares various supervised model selection methods, highlighting the superiority of Dawids prequential over marginal likelihood in classification tasks.

Findings

01

Marginal likelihood performs poorly in supervised model selection.

02

Dawids prequential approach yields the best predictive performance.

03

Empirical results are based on numerous publicly available datasets.

Abstract

Given a set of possible models (e.g., Bayesian network structures) and a data sample, in the unsupervised model selection problem the task is to choose the most accurate model with respect to the domain joint probability distribution. In contrast to this, in supervised model selection it is a priori known that the chosen model will be used in the future for prediction tasks involving more ``focused' predictive distributions. Although focused predictive distributions can be produced from the joint probability distribution by marginalization, in practice the best model in the unsupervised sense does not necessarily perform well in supervised domains. In particular, the standard marginal likelihood score is a criterion for the unsupervised task, and, although frequently used for supervised model selection also, does not perform well in such tasks. In this paper we study the performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Algorithms and Data Compression · Statistical Methods and Inference