A comparison between initialization strategies for the infinite hidden Markov model
Federico P. Cortese, Luca Rossini

TL;DR
This paper evaluates various initialization strategies for infinite hidden Markov models, finding that distance-based clustering methods outperform traditional model-based and uniform approaches in both simulated and real data scenarios.
Contribution
It systematically compares initialization methods for infinite HMMs, highlighting the superior performance of distance-based clustering strategies over commonly used alternatives.
Findings
Distance-based clustering outperforms other initialization methods.
Uniform initialization, commonly used, performs poorly.
Results are consistent across simulated and real datasets.
Abstract
Infinite hidden Markov models provide a flexible framework for modelling time series with structural changes and complex dynamics, without requiring the number of latent states to be specified in advance. This flexibility is achieved through the hierarchical Dirichlet process prior, while efficient Bayesian inference is enabled by the beam sampler, which combines dynamic programming with slice sampling to truncate the infinite state space adaptively. Despite extensive methodological developments, the role of initialization in this framework has received limited attention. This study addresses this gap by systematically evaluating initialization strategies commonly used for finite hidden Markov models and assessing their suitability in the infinite setting. Results from both simulated and real datasets show that distance-based clustering initializations consistently outperform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference · Time Series Analysis and Forecasting
