A Chat About Boring Problems: Studying GPT-based text normalization

Yang Zhang; Travis M. Bartley; Mariana Graterol-Fuenmayor; Vitaly; Lavrukhin; Evelina Bakhturina; Boris Ginsburg

arXiv:2309.13426·cs.CL·January 18, 2024

A Chat About Boring Problems: Studying GPT-based text normalization

Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly, Lavrukhin, Evelina Bakhturina, Boris Ginsburg

PDF

Open Access

TL;DR

This paper demonstrates that Large-Language Models like GPT-3.5 and GPT-4 can effectively perform text normalization in few-shot settings, achieving significantly lower error rates than traditional systems by using innovative prompting and error analysis.

Contribution

It introduces a novel approach combining self-consistency and linguistic-informed prompts for LLM-based text normalization, and develops a new error taxonomy to analyze model performance.

Findings

01

LLMs achieve around 40% lower error rates than traditional systems.

02

Self-consistency reasoning improves normalization accuracy.

03

A new taxonomy reveals strengths and weaknesses of GPT-based normalization.

Abstract

Text normalization - the conversion of text from written to spoken form - is traditionally assumed to be an ill-formed task for language models. In this work, we argue otherwise. We empirically show the capacity of Large-Language Models (LLM) for text normalization in few-shot scenarios. Combining self-consistency reasoning with linguistic-informed prompt engineering, we find LLM based text normalization to achieve error rates around 40\% lower than top normalization systems. Further, upon error analysis, we note key limitations in the conventional design of text normalization tasks. We create a new taxonomy of text normalization errors and apply it to results from GPT-3.5-Turbo and GPT-4.0. Through this new framework, we can identify strengths and weaknesses of GPT-based TN, opening opportunities for future work.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Cosine Annealing · Position-Wise Feed-Forward Layer · Residual Connection · Transformer