Order selection with confidence for finite mixture models
Hien D Nguyen, Daniel Fryer, Geoffrey McLachlan

TL;DR
This paper introduces a sequential testing procedure based on the closed testing principle for confidently determining the number of components in finite mixture models, supported by finite sample tests and asymptotic consistency.
Contribution
It develops a novel sequential testing framework with finite sample tests for mixture model order selection and confidence statements, extending applicability to nested models.
Findings
Finite sample tests are consistent against fixed alternatives.
The proposed STP provides reliable confidence statements about model order.
A modified STP achieves asymptotic consistency in order selection.
Abstract
The determination of the number of mixture components (the order) of a finite mixture model has been an enduring problem in statistical inference. We prove that the closed testing principle leads to a sequential testing procedure (STP) that allows for confidence statements to be made regarding the order of a finite mixture model. We construct finite sample tests, via data splitting and data swapping, for use in the STP, and we prove that such tests are consistent against fixed alternatives. Simulation studies and real data examples are used to demonstrate the performance of the finite sample tests-based STP, yielding practical recommendations of their use as confidence estimators in combination with point estimates such as the Akaike information or Bayesian information criteria. In addition, we demonstrate that a modification of the STP yields a method that consistently selects the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Distribution Estimation and Applications
