Frontiers to the learning of nonparametric hidden Markov models
Kweku Abraham, Elisabeth Gassiat, Zacharie Naulet

TL;DR
This paper investigates the learnability of nonparametric two-state hidden Markov models, revealing a transition in estimation difficulty near the i.i.d. boundary and highlighting the impact of population smoothness differences.
Contribution
It provides the first nonasymptotic, nonparametric minimax risk analysis of two-state HMMs considering the boundary between learnable and unlearnable cases.
Findings
Identifies a phase transition in estimation rates near the i.i.d. boundary.
Shows that smoother densities can help improve estimation of less smooth densities.
Provides upper and lower bounds on the minimax risk for nonparametric HMMs.
Abstract
Hidden Markov models (HMMs) are flexible tools for clustering dependent data coming from unknown populations, allowing nonparametric modelling of the population densities. Identifiability fails when the data is in fact independent and identically distributed (i.i.d.), and we study the frontier between learnable and unlearnable two-state nonparametric HMMs. Learning the parameters of the HMM requires solving a nonlinear inverse problem whose difficulty depends not only on the smoothnesses of the populations but also on the distance to the i.i.d. boundary of the parameter set. The latter difficulty is mostly ignored in the literature in favour of assumptions precluding nearly independent data. This is the first work conducting a precise nonasymptotic, nonparametric analysis of the minimax risk taking into account all aspects of the hardness of the problem, in the case of two populations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Data Quality and Management · Human Mobility and Location-Based Analysis
