Sample Size Considerations for Bayesian Multilevel Hidden Markov Models: A Simulation Study on Multivariate Continuous Data with highly overlapping Component Distributions based on Sleep Data
Jasper Ginn, Sebastian Mildiner Moraga, Emmeke Aarts

TL;DR
This study investigates how sample size, number of occasions, and variability affect Bayesian multilevel Hidden Markov Model estimates using sleep data, providing practical guidelines for researchers in social and behavioral sciences.
Contribution
It offers the first systematic simulation-based evaluation of sample size requirements for Bayesian multilevel HMMs with multivariate continuous data.
Findings
Number of subjects significantly impacts model performance.
Number of occasions influences the accuracy of latent state transition estimates.
Data characteristics affect parameter estimation and model reliability.
Abstract
Spurred in part by the ever-growing number of sensors and web-based methods of collecting data, the use of Intensive Longitudinal Data (ILD) is becoming more common in the social and behavioural sciences. The ILD collected in this field are often hypothesised to be the result of latent states (e.g. behaviour, emotions), and the promise of ILD lies in its ability to capture the dynamics of these states as they unfold in time. In particular, by collecting data for multiple subjects, researchers can observe how such dynamics differ between subjects. The Bayesian Multilevel Hidden Markov Model (mHMM) is a relatively novel model that is suited to model the ILD of this kind while taking into account heterogeneity between subjects. While the mHMM has been applied in a variety of settings, large-scale studies that examine the required sample size for this model are lacking. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Transportation Planning and Optimization
