Bayesian posterior probabilities: revisited
David A. Morrison

TL;DR
This paper reexamines the accuracy of Bayesian posterior probabilities in phylogenetics, arguing they tend to be overestimated even when models match the data, especially with longer sequences and more complex models.
Contribution
It challenges previous conclusions by showing Bayesian posterior probabilities are slightly overestimated under ideal model conditions and discusses implications for model complexity and sequence length.
Findings
Posterior probabilities are slightly too large even when models match.
Overestimation increases with longer sequence data.
Model complexity may also influence overestimation.
Abstract
Huelsenbeck and Rannala (2004, Systematic Biology 53, 904-913) presented a series of simulations in order to assess the extent to which the bayesian posterior probabilities associated with phylogenetic trees represent the standard frequentist statistical interpretation. They concluded that when the analysis model matches the generating model then the bayesian posterior probabilities are correct, but that the probabilities are much too large when the model is under-specified and slightly too small when the model is over-specified. Here, I take issue with the first conclusion, and instead contend that their simulation data show that the posterior probabilities are still slightly too large even when the models match. Furthermore, I suggest that the data show that the degree of this over-estimation increases as the sequence length increases, and that it might increase as model complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
