Spectral dimensionality reduction for HMMs

Dean P. Foster; Jordan Rodu; Lyle H. Ungar

arXiv:1203.6130·stat.ML·March 29, 2012·20 cites

Spectral dimensionality reduction for HMMs

Dean P. Foster, Jordan Rodu, Lyle H. Ungar

PDF

Open Access

TL;DR

This paper introduces a spectral method for Hidden Markov Models that reduces parameter estimation complexity and sample requirements, independent of observation vocabulary size, with provable accuracy bounds based on observable data.

Contribution

A novel spectral approach that minimizes parameter count and sample complexity for HMMs, with data-driven accuracy guarantees.

Findings

01

Reduces number of parameters needed for HMM approximation

02

Sample complexity independent of observation vocabulary size

03

Provides bounds on probability estimate accuracy

Abstract

Hidden Markov Models (HMMs) can be accurately approximated using co-occurrence frequencies of pairs and triples of observations by using a fast spectral method in contrast to the usual slow methods like EM or Gibbs sampling. We provide a new spectral method which significantly reduces the number of model parameters that need to be estimated, and generates a sample complexity that does not depend on the size of the observation vocabulary. We present an elementary proof giving bounds on the relative accuracy of probability estimates from our model. (Correlaries show our bounds can be weakened to provide either L1 bounds or KL bounds which provide easier direct comparisons to previous work.) Our theorem uses conditions that are checkable from the data, instead of putting conditions on the unobservable Markov transition matrix.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Blind Source Separation Techniques · Markov Chains and Monte Carlo Methods