Revisiting the DARPA Communicator Data using Conversation Analysis

Peter Wallis

arXiv:2307.06982·cs.CL·July 17, 2023

Revisiting the DARPA Communicator Data using Conversation Analysis

Peter Wallis

PDF

TL;DR

This paper applies conversation analysis to DARPA Communicator transcripts to identify system failures, revealing that handling mixed initiative discourse is a key challenge in human-computer dialogue systems.

Contribution

It demonstrates the use of conversation analysis to diagnose failures in dialogue systems, highlighting the importance of discourse-level understanding for improvement.

Findings

01

Swear words indicate points of failure in user-system interactions.

02

Failure to handle mixed initiative discourse causes user frustration.

03

Conversation analysis offers a qualitative approach to diagnosing dialogue system issues.

Abstract

The state of the art in human computer conversation leaves something to be desired and, indeed, talking to a computer can be down-right annoying. This paper describes an approach to identifying ``opportunities for improvement'' in these systems by looking for abuse in the form of swear words. The premise is that humans swear at computers as a sanction and, as such, swear words represent a point of failure where the system did not behave as it should. Having identified where things went wrong, we can work backward through the transcripts and, using conversation analysis (CA) work out how things went wrong. Conversation analysis is a qualitative methodology and can appear quite alien - indeed unscientific - to those of us from a quantitative background. The paper starts with a description of Conversation analysis in its modern form, and then goes on to apply the methodology to transcripts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.