Correcting Automated and Manual Speech Transcription Errors using Warped   Language Models

Mahdi Namazifar; John Malik; Li Erran Li; Gokhan Tur; Dilek Hakkani; T\"ur

arXiv:2103.14580·cs.CL·March 29, 2021

Correcting Automated and Manual Speech Transcription Errors using Warped Language Models

Mahdi Namazifar, John Malik, Li Erran Li, Gokhan Tur, Dilek Hakkani, T\"ur

PDF

Open Access

TL;DR

This paper introduces a novel approach using warped language models to effectively correct errors in automatic and manual speech transcriptions, significantly reducing word error rates.

Contribution

The work presents a new method leveraging warped language models' robustness to transcription noise for improved speech transcription correction.

Findings

01

Achieves up to 10% reduction in word error rates

02

Effective on both automatic and manual transcriptions

03

Demonstrates robustness to different types of transcription errors

Abstract

Masked language models have revolutionized natural language processing systems in the past few years. A recently introduced generalization of masked language models called warped language models are trained to be more robust to the types of errors that appear in automatic or manual transcriptions of spoken language by exposing the language model to the same types of errors during training. In this work we propose a novel approach that takes advantage of the robustness of warped language models to transcription noise for correcting transcriptions of spoken language. We show that our proposed approach is able to achieve up to 10% reduction in word error rates of both automatic and manual transcriptions of spoken language.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Topic Modeling