GMM-Free Flat Start Sequence-Discriminative DNN Training

G\'abor Gosztolya; Tam\'as Gr\'osz; L\'aszl\'o T\'oth

arXiv:1610.03256·cs.CL·October 18, 2016

GMM-Free Flat Start Sequence-Discriminative DNN Training

G\'abor Gosztolya, Tam\'as Gr\'osz, L\'aszl\'o T\'oth

PDF

TL;DR

This paper introduces a GMM-free, sequence-discriminative flat start training method for DNN-based HMMs that is faster and yields slightly better word error rates compared to traditional iterative approaches.

Contribution

It proposes a novel GMM-free flat start training approach using sequence-discriminative criteria and KL-divergence-based state clustering, improving speed and accuracy.

Findings

01

Faster training than iterative realignment methods

02

Slightly improved word error rates

03

Effective GMM-free training process

Abstract

Recently, attempts have been made to remove Gaussian mixture models (GMM) from the training process of deep neural network-based hidden Markov models (HMM/DNN). For the GMM-free training of a HMM/DNN hybrid we have to solve two problems, namely the initial alignment of the frame-level state labels and the creation of context-dependent states. Although flat-start training via iteratively realigning and retraining the DNN using a frame-level error function is viable, it is quite cumbersome. Here, we propose to use a sequence-discriminative training criterion for flat start. While sequence-discriminative training is routinely applied only in the final phase of model training, we show that with proper caution it is also suitable for getting an alignment of context-independent DNN models. For the construction of tied states we apply a recently proposed KL-divergence-based state clustering…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.