Fundamental limits for learning hidden Markov model parameters
Kweku Abraham, Zacharie Naulet, Elisabeth Gassiat

TL;DR
This paper investigates the fundamental boundaries between learnable and unlearnable hidden Markov models, providing theoretical bounds and thresholds for parameter identifiability.
Contribution
It establishes nonasymptotic minimax bounds for two-state HMMs and characterizes the thresholds where parameters become learnable.
Findings
Identifies conditions for full parameter identifiability in HMMs.
Provides minimax bounds showing learnability thresholds.
Offers bounds on the relative entropy rate near unlearnable regions.
Abstract
We study the frontier between learnable and unlearnable hidden Markov models (HMMs). HMMs are flexible tools for clustering dependent data coming from unknown populations. The model parameters are known to be fully identifiable (up to label-switching) without any modeling assumption on the distributions of the populations as soon as the clusters are distinct and the hidden chain is ergodic with a full rank transition matrix. In the limit as any one of these conditions fails, it becomes impossible in general to identify parameters. For a chain with two hidden states we prove nonasymptotic minimax upper and lower bounds, matching up to constants, which exhibit thresholds at which the parameters become learnable. We also provide an upper bound on the relative entropy rate for parameters in a neighbourhood of the unlearnable region which may have interest in itself.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
