WERd: Using Social Text Spelling Variants for Evaluating Dialectal   Speech Recognition

Ahmed Ali; Preslav Nakov; Peter Bell; Steve Renals

arXiv:1709.07484·cs.CL·September 25, 2017

WERd: Using Social Text Spelling Variants for Evaluating Dialectal Speech Recognition

Ahmed Ali, Preslav Nakov, Peter Bell, Steve Renals

PDF

TL;DR

This paper introduces WERd, a new evaluation metric for dialectal speech recognition that accounts for non-standard orthography by using social media spelling variants, improving upon traditional WER.

Contribution

The paper proposes WERd, a dialect-specific evaluation metric based on TERp, leveraging social media spelling variants to better assess dialectal ASR systems.

Findings

01

WERd outperforms traditional WER in dialectal ASR evaluation

02

Social media spelling variants improve evaluation accuracy

03

Manual analysis confirms WERd's effectiveness

Abstract

We study the problem of evaluating automatic speech recognition (ASR) systems that target dialectal speech input. A major challenge in this case is that the orthography of dialects is typically not standardized. From an ASR evaluation perspective, this means that there is no clear gold standard for the expected output, and several possible outputs could be considered correct according to different human annotators, which makes standard word error rate (WER) inadequate as an evaluation metric. Such a situation is typical for machine translation (MT), and thus we borrow ideas from an MT evaluation metric, namely TERp, an extension of translation error rate which is closely-related to WER. In particular, in the process of comparing a hypothesis to a reference, we make use of spelling variants for words and phrases, which we mine from Twitter in an unsupervised fashion. Our experiments with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.