RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Liyan Xu, Yile Gu, Jari Kolehmainen, Haidar Khan, Ankur Gandhe, Ariya, Rastrow, Andreas Stolcke, Ivan Bulyko

TL;DR
RescoreBERT introduces a novel discriminative training approach for BERT-based speech recognition rescoring, significantly reducing word error rates and latency in ASR systems by integrating a minimum WER loss with pretrained language models.
Contribution
The paper presents a new method for training BERT with a discriminative MWER loss for ASR rescoring, combining MLM pretraining with discriminative fine-tuning.
Findings
Reduces WER by 6.6%/3.4% on LibriSpeech test sets.
Decreases latency and WER by 3-8% on conversational datasets.
Outperforms baseline BERT rescoring without discriminative training.
Abstract
Second-pass rescoring is an important component in automatic speech recognition (ASR) systems that is used to improve the outputs from a first-pass decoder by implementing a lattice rescoring or -best re-ranking. While pretraining with a masked language model (MLM) objective has received great success in various natural language understanding (NLU) tasks, it has not gained traction as a rescoring model for ASR. Specifically, training a bidirectional model like BERT on a discriminative objective such as minimum WER (MWER) has not been explored. Here we show how to train a BERT-based rescoring model with MWER loss, to incorporate the improvements of a discriminative loss into fine-tuning of deep bidirectional pretrained models for ASR. Specifically, we propose a fusion strategy that incorporates the MLM into the discriminative training process to effectively distill knowledge from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · WordPiece · Dense Connections · Linear Warmup With Linear Decay · Residual Connection · Tanh Activation · Softmax
