Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
Yerbolat Khassanov, Eng Siong Chng

TL;DR
This paper introduces an efficient method to expand the vocabulary of pretrained RNN language models in ASR systems without retraining, by augmenting embedding layers with OOS words derived from ASR outputs and linguistic knowledge.
Contribution
It presents a novel approach to enlarge RNNLM vocabulary efficiently by expanding embedding matrices and leveraging unsupervised extraction of out-of-shortlist words from ASR outputs.
Findings
Effective vocabulary expansion without retraining.
Maintains RNNLM performance with OOS words.
Unsupervised extraction of OOS words from ASR outputs.
Abstract
In automatic speech recognition (ASR) systems, recurrent neural network language models (RNNLM) are used to rescore a word lattice or N-best hypotheses list. Due to the expensive training, the RNNLM's vocabulary set accommodates only small shortlist of most frequent words. This leads to suboptimal performance if an input speech contains many out-of-shortlist (OOS) words. An effective solution is to increase the shortlist size and retrain the entire network which is highly inefficient. Therefore, we propose an efficient method to expand the shortlist set of a pretrained RNNLM without incurring expensive retraining and using additional training data. Our method exploits the structure of RNNLM which can be decoupled into three parts: input projection layer, middle layers, and output projection layer. Specifically, our method expands the word embedding matrices in projection layers and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
