Boosting Punctuation Restoration with Data Generation and Reinforcement   Learning

Viet Dac Lai; Abel Salinas; Hao Tan; Trung Bui; Quan Tran; Seunghyun; Yoon; Hanieh Deilamsalehy; Franck Dernoncourt; Thien Huu Nguyen

arXiv:2307.12949·cs.CL·July 25, 2023

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

Viet Dac Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun, Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Huu Nguyen

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach combined with large generative language models to improve punctuation restoration in ASR texts, addressing the data discrepancy issue and achieving state-of-the-art results.

Contribution

It presents a novel reinforcement learning method leveraging pre-trained generative models to enhance punctuation restoration for ASR outputs.

Findings

01

Achieved state-of-the-art performance on two benchmark datasets.

02

Effectively bridged the gap between written and ASR texts.

03

Improved punctuation accuracy in ASR transcripts.

Abstract

Punctuation restoration is an important task in automatic speech recognition (ASR) which aim to restore the syntactic structure of generated ASR texts to improve readability. While punctuated texts are abundant from written documents, the discrepancy between written punctuated texts and ASR texts limits the usability of written texts in training punctuation restoration systems for ASR texts. This paper proposes a reinforcement learning method to exploit in-topic written texts and recent advances in large pre-trained generative language models to bridge this gap. The experiments show that our method achieves state-of-the-art performance on the ASR test set on two benchmark datasets for punctuation restoration.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis