Revisiting the DARPA Communicator Data using Conversation Analysis
Peter Wallis

TL;DR
This paper applies conversation analysis to DARPA Communicator transcripts to identify system failures, revealing that handling mixed initiative discourse is a key challenge in human-computer dialogue systems.
Contribution
It demonstrates the use of conversation analysis to diagnose failures in dialogue systems, highlighting the importance of discourse-level understanding for improvement.
Findings
Swear words indicate points of failure in user-system interactions.
Failure to handle mixed initiative discourse causes user frustration.
Conversation analysis offers a qualitative approach to diagnosing dialogue system issues.
Abstract
The state of the art in human computer conversation leaves something to be desired and, indeed, talking to a computer can be down-right annoying. This paper describes an approach to identifying ``opportunities for improvement'' in these systems by looking for abuse in the form of swear words. The premise is that humans swear at computers as a sanction and, as such, swear words represent a point of failure where the system did not behave as it should. Having identified where things went wrong, we can work backward through the transcripts and, using conversation analysis (CA) work out how things went wrong. Conversation analysis is a qualitative methodology and can appear quite alien - indeed unscientific - to those of us from a quantitative background. The paper starts with a description of Conversation analysis in its modern form, and then goes on to apply the methodology to transcripts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
