Best-first Model Merging for Hidden Markov Model Induction
Andreas Stolcke (ICSI, Berkeley, CA), Stephen M. Omohundro (ICSI,, Berkeley, CA)

TL;DR
This paper introduces a best-first model merging technique for inducing Hidden Markov Models from data, which improves robustness and accuracy over traditional methods, especially with limited training data.
Contribution
The paper presents a novel heuristic merging algorithm for HMM induction that uses Bayesian criteria and demonstrates its effectiveness across multiple speech-related applications.
Findings
Merging algorithm outperforms Baum-Welch in small data scenarios
Produces more compact and accurate speech models
Enhances speech recognition performance with multi-pronunciation models
Abstract
This report describes a new technique for inducing the structure of Hidden Markov Models from data which is based on the general `model merging' strategy (Omohundro 1992). The process begins with a maximum likelihood HMM that directly encodes the training data. Successively more general models are produced by merging HMM states. A Bayesian posterior probability criterion is used to determine which states to merge and when to stop generalizing. The procedure may be considered a heuristic search for the HMM structure with the highest posterior probability. We discuss a variety of possible priors for HMMs, as well as a number of approximations which improve the computational efficiency of the algorithm. We studied three applications to evaluate the procedure. The first compares the merging algorithm with the standard Baum-Welch approach in inducing simple finite-state languages from small,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Topic Modeling
