WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm
Akshay Krishna Sheshadri, Anvesh Rao Vijjini, Sukhdeep Kharbanda

TL;DR
This paper introduces WER-BERT, a novel BERT-based model for automatic WER estimation that addresses class imbalance and the ordinal nature of WER, achieving state-of-the-art results on standard datasets.
Contribution
It proposes a balanced classification paradigm for e-WER, a new distance loss for ordinal data, and a BERT-based architecture tailored for WER prediction.
Findings
Achieves state-of-the-art accuracy on Librispeech and Google API datasets.
Effectively handles class imbalance in WER classification.
Demonstrates the benefit of the distance loss for ordinal WER estimation.
Abstract
Automatic Speech Recognition (ASR) systems are evaluated using Word Error Rate (WER), which is calculated by comparing the number of errors between the ground truth and the transcription of the ASR system. This calculation, however, requires manual transcription of the speech signal to obtain the ground truth. Since transcribing audio signals is a costly process, Automatic WER Evaluation (e-WER) methods have been developed to automatically predict the WER of a speech system by only relying on the transcription and the speech signal features. While WER is a continuous variable, previous works have shown that positing e-WER as a classification problem is more effective than regression. However, while converting to a classification setting, these approaches suffer from heavy class imbalance. In this paper, we propose a new balanced paradigm for e-WER in a classification setting. Within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Dropout · Softmax · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Attention Dropout · WordPiece · Residual Connection · Adam
