Conversational Feedback in Scripted versus Spontaneous Dialogues: A   Comparative Analysis

Ildik\'o Pil\'an; Laurent Pr\'evot; Hendrik Buschmeier; Pierre Lison

arXiv:2309.15656·cs.CL·October 4, 2024·1 cites

Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis

Ildik\'o Pil\'an, Laurent Pr\'evot, Hendrik Buschmeier, Pierre Lison

PDF

Open Access 1 Repo

TL;DR

This study compares communicative feedback in scripted subtitles and spontaneous dialogues across multiple languages, revealing significant differences in feedback frequency and types, with implications for training conversational NLP models.

Contribution

The paper provides a comprehensive quantitative analysis of feedback phenomena in scripted versus spontaneous dialogues, highlighting key linguistic differences across languages and genres.

Findings

01

Feedback is less frequent in subtitles than in spontaneous dialogues.

02

Subtitles contain a higher proportion of negative feedback.

03

LLM-generated dialogues resemble scripted dialogues more than spontaneous ones.

Abstract

Scripted dialogues such as movie and TV subtitles constitute a widespread source of training data for conversational NLP models. However, there are notable linguistic differences between these dialogues and spontaneous interactions, especially regarding the occurrence of communicative feedback such as backchannels, acknowledgments, or clarification requests. This paper presents a quantitative analysis of such feedback phenomena in both subtitles and spontaneous conversations. Based on conversational data spanning eight languages and multiple genres, we extract lexical statistics, classifications from a dialogue act tagger, expert annotations and labels derived from a fine-tuned Large Language Model (LLM). Our main empirical findings are that (1) communicative feedback is markedly less frequent in subtitles than in spontaneous dialogues and (2) subtitles contain a higher proportion of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

norskregnesentral/conv_feedback
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification