N-gram Boosting: Improving Contextual Biasing with Normalized N-gram   Targets

Wang Yau Li; Shreekantha Nadig; Karol Chang; Zafarullah Mahmood,; Riqiang Wang; Simon Vandieken; Jonas Robertson; Fred Mailhot

arXiv:2308.02092·cs.CL·August 7, 2023

N-gram Boosting: Improving Contextual Biasing with Normalized N-gram Targets

Wang Yau Li, Shreekantha Nadig, Karol Chang, Zafarullah Mahmood,, Riqiang Wang, Simon Vandieken, Jonas Robertson, Fred Mailhot

PDF

Open Access

TL;DR

This paper introduces N-gram Boosting, a method that enhances speech-to-text accuracy for proper names and technical terms by boosting normalized n-gram targets, significantly improving recognition rates especially for complex tokens.

Contribution

It proposes a novel two-step boosting mechanism for normalized n-grams, addressing missing hits and over-boosting issues in keyword recognition tasks.

Findings

01

26% relative improvement on in-domain dataset

02

2% improvement on LibriSpeech

03

Effective for non-standard and non-alphabetic targets

Abstract

Accurate transcription of proper names and technical terms is particularly important in speech-to-text applications for business conversations. These words, which are essential to understanding the conversation, are often rare and therefore likely to be under-represented in text and audio training data, creating a significant challenge in this domain. We present a two-step keyword boosting mechanism that successfully works on normalized unigrams and n-grams rather than just single tokens, which eliminates missing hits issues with boosting raw targets. In addition, we show how adjusting the boosting weight logic avoids over-boosting multi-token keywords. This improves our keyword recognition rate by 26% relative on our proprietary in-domain dataset and 2% on LibriSpeech. This method is particularly useful on targets that involve non-alphabetic characters or have non-standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques