A Comparison of Methods for OOV-word Recognition on a New Public Dataset

Rudolf A. Braun; Srikanth Madikeri; Petr Motlicek

arXiv:2107.08091·cs.CL·July 20, 2021

A Comparison of Methods for OOV-word Recognition on a New Public Dataset

Rudolf A. Braun, Srikanth Madikeri, Petr Motlicek

PDF

2 Repos

TL;DR

This paper introduces a new dataset, evaluation tools, and methods for improving out-of-vocabulary word recognition in speech recognition systems, demonstrating significant performance gains.

Contribution

It presents a new dataset and evaluation framework, compares subword models with traditional methods, and proposes a novel approach to enhance OOV-word recognition in hybrid ASR systems.

Findings

01

Subword models outperform traditional models in OOV recognition

02

Incorporating OOV-word information improves recognition accuracy

03

Proposed modifications significantly increase OOV-word recognition rates

Abstract

A common problem for automatic speech recognition systems is how to recognize words that they did not see during training. Currently there is no established method of evaluating different techniques for tackling this problem. We propose using the CommonVoice dataset to create test sets for multiple languages which have a high out-of-vocabulary (OOV) ratio relative to a training set and release a new tool for calculating relevant performance metrics. We then evaluate, within the context of a hybrid ASR system, how much better subword models are at recognizing OOVs, and how much benefit one can get from incorporating OOV-word information into an existing system by modifying WFSTs. Additionally, we propose a new method for modifying a subword-based language model so as to better recognize OOV-words. We showcase very large improvements in OOV-word recognition and make both the data and code…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.