Innovative Bert-based Reranking Language Models for Speech Recognition
Shih-Hsuan Chiu, Berlin Chen

TL;DR
This paper introduces BERT-based models for reranking speech recognition hypotheses, leveraging contextual understanding and global topic information to improve accuracy, demonstrated through experiments on the AMI corpus.
Contribution
It presents a novel BERT-based reranking approach (PBERT and TPBERT) for speech recognition, incorporating global topic information for enhanced hypothesis selection.
Findings
PBERT outperforms traditional RNN-based reranking.
TPBERT effectively utilizes global topic information.
Methods show significant WER reduction on AMI corpus.
Abstract
More recently, Bidirectional Encoder Representations from Transformers (BERT) was proposed and has achieved impressive success on many natural language processing (NLP) tasks such as question answering and language understanding, due mainly to its effective pre-training then fine-tuning paradigm as well as strong local contextual modeling ability. In view of the above, this paper presents a novel instantiation of the BERT-based contextualized language models (LMs) for use in reranking of N-best hypotheses produced by automatic speech recognition (ASR). To this end, we frame N-best hypothesis reranking with BERT as a prediction problem, which aims to predict the oracle hypothesis that has the lowest word error rate (WER) given the N-best hypotheses (denoted by PBERT). In particular, we also explore to capitalize on task-specific global topic information in an unsupervised manner to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Weight Decay · Layer Normalization · Linear Warmup With Linear Decay · Attention Dropout · WordPiece · Residual Connection
