Three Approaches to Probability Model Selection
William B. Poland, Ross D. Shachter

TL;DR
This paper compares three methods for selecting probability models, highlighting their relationships and applicability, especially in computationally efficient and AI contexts, with a focus on penalized likelihood frameworks.
Contribution
It unifies three model selection approaches under a penalized likelihood framework and demonstrates their relationships and practical advantages in AI applications.
Findings
All three methods can be viewed as maximizing a penalized likelihood.
The second approach reduces to the first with suitable priors.
The third approach reduces to the second for a specific effectiveness ratio.
Abstract
This paper compares three approaches to the problem of selecting among probability models to fit data (1) use of statistical criteria such as Akaike's information criterion and Schwarz's "Bayesian information criterion," (2) maximization of the posterior probability of the model, and (3) maximization of an effectiveness ratio? trading off accuracy and computational cost. The unifying characteristic of the approaches is that all can be viewed as maximizing a penalized likelihood function. The second approach with suitable prior distributions has been shown to reduce to the first. This paper shows that the third approach reduces to the second for a particular form of the effectiveness ratio, and illustrates all three approaches with the problem of selecting the number of components in a mixture of Gaussian distributions. Unlike the first two approaches, the third can be used even when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models · Advanced Statistical Methods and Models
