Tag and correct: high precision post-editing approach to correction of   speech recognition errors

Tomasz Zi\k{e}tkiewicz

arXiv:2406.07589·cs.CL·June 13, 2024

Tag and correct: high precision post-editing approach to correction of speech recognition errors

Tomasz Zi\k{e}tkiewicz

PDF

TL;DR

This paper introduces a neural post-editing method for speech recognition error correction that achieves high precision, is resource-efficient, and suitable for industrial deployment across various ASR systems.

Contribution

A novel neural sequence tagging approach for speech recognition correction that is resource-efficient and adaptable to any ASR system, emphasizing high precision and low training costs.

Findings

01

Comparable performance to previous methods

02

Requires significantly less training resources

03

Suitable for real-time industrial applications

Abstract

This paper presents a new approach to the problem of correcting speech recognition errors by means of post-editing. It consists of using a neural sequence tagger that learns how to correct an ASR (Automatic Speech Recognition) hypothesis word by word and a corrector module that applies corrections returned by the tagger. The proposed solution is applicable to any ASR system, regardless of its architecture, and provides high-precision control over errors being corrected. This is especially crucial in production environments, where avoiding the introduction of new mistakes by the error correction model may be more important than the net gain in overall results. The results show that the performance of the proposed error correction models is comparable with previous approaches while requiring much smaller resources to train, which makes it suitable for industrial applications, where both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.