The importance of fillers for text representations of speech transcripts

Tanvi Dinkar; Pierre Colombo; Matthieu Labeau; Chlo\'e Clavel

arXiv:2009.11340·cs.CL·October 2, 2020

The importance of fillers for text representations of speech transcripts

Tanvi Dinkar, Pierre Colombo, Matthieu Labeau, Chlo\'e Clavel

PDF

TL;DR

This paper investigates the role of fillers in spoken language understanding, demonstrating that representing fillers with deep contextualized embeddings enhances modeling spoken language and improves downstream task performance.

Contribution

It introduces a method for representing fillers using deep contextualized embeddings, showing their importance in SLU tasks and downstream applications.

Findings

01

Improved performance on stance prediction task

02

Enhanced modeling of spoken language with fillers

03

Fillers' representations contribute significantly to downstream tasks

Abstract

While being an essential component of spoken language, fillers (e.g."um" or "uh") often remain overlooked in Spoken Language Understanding (SLU) tasks. We explore the possibility of representing them with deep contextualised embeddings, showing improvements on modelling spoken language and two downstream tasks - predicting a speaker's stance and expressed confidence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.