Neural Inverse Text Normalization

Monica Sunkara; Chaitanya Shivade; Sravan Bodapati; Katrin Kirchhoff

arXiv:2102.06380·cs.CL·February 15, 2021

Neural Inverse Text Normalization

Monica Sunkara, Chaitanya Shivade, Sravan Bodapati, Katrin Kirchhoff

PDF

TL;DR

This paper introduces a neural inverse text normalization method using transformer models combined with FST techniques, improving accuracy and scalability across multiple languages and reducing errors in speech recognition outputs.

Contribution

It presents a novel neural approach for inverse text normalization that is scalable, language-agnostic, and effectively integrated with existing FST methods for improved performance.

Findings

01

Reduces errors in ASR output across multiple languages

02

Outperforms baseline models on English, Spanish, German, and Italian datasets

03

Maintains high quality on out-of-domain data

Abstract

While there have been several contributions exploring state of the art techniques for text normalization, the problem of inverse text normalization (ITN) remains relatively unexplored. The best known approaches leverage finite state transducer (FST) based models which rely on manually curated rules and are hence not scalable. We propose an efficient and robust neural solution for ITN leveraging transformer based seq2seq models and FST-based text normalization techniques for data preparation. We show that this can be easily extended to other languages without the need for a linguistic expert to manually curate them. We then present a hybrid framework for integrating Neural ITN with an FST to overcome common recoverable errors in production environments. Our empirical evaluations show that the proposed solution minimizes incorrect perturbations (insertions, deletions and substitutions) to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence