HypR: A comprehensive study for ASR hypothesis revising with a reference   corpus

Yi-Wei Wang; Ke-Han Lu; Kuan-Yu Chen

arXiv:2309.09838·cs.CL·June 14, 2024

HypR: A comprehensive study for ASR hypothesis revising with a reference corpus

Yi-Wei Wang, Ke-Han Lu, Kuan-Yu Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces HypR, a comprehensive dataset and benchmark for ASR hypothesis revising, facilitating fair comparison of error correction and reranking methods to improve speech recognition accuracy.

Contribution

It provides the first unified dataset with multiple corpora and hypotheses, along with implemented baseline methods, to standardize evaluation in ASR hypothesis revising research.

Findings

01

Compared various revising methods on the HypR dataset

02

Demonstrated the effectiveness of different error correction techniques

03

Established a benchmark for future ASR hypothesis revising studies

Abstract

With the development of deep learning, automatic speech recognition (ASR) has made significant progress. To further enhance the performance of ASR, revising recognition results is one of the lightweight but efficient manners. Various methods can be roughly classified into N-best reranking modeling and error correction modeling. The former aims to select the hypothesis with the lowest error rate from a set of candidates generated by ASR for a given input speech. The latter focuses on detecting recognition errors in a given hypothesis and correcting these errors to obtain an enhanced result. However, we observe that these studies are hardly comparable to each other, as they are usually evaluated on different corpora, paired with different ASR models, and even use different datasets to train the models. Accordingly, we first concentrate on providing an ASR hypothesis revising (HypR)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alfred0622/hypr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling