Metrics for Evaluating Dialogue Strategies in a Spoken Language System
Morena Danieli, Elisabetta Gerbino (CSELT - Torino, Italy)

TL;DR
This paper introduces a comprehensive set of metrics, including a novel implicit recovery metric, for evaluating dialogue management strategies in spoken language systems, providing insights into their impact on dialogue quality.
Contribution
It proposes a new metric, implicit recovery, and applies it alongside established metrics to evaluate and compare dialogue repair strategies in real-time spoken language systems.
Findings
Implicit recovery effectively measures error handling capabilities.
Different dialogue repair strategies show varying performance on proposed metrics.
Metrics provide insights into dialogue quality and management effectiveness.
Abstract
In this paper, we describe a set of metrics for the evaluation of different dialogue management strategies in an implemented real-time spoken language system. The set of metrics we propose offers useful insights in evaluating how particular choices in the dialogue management can affect the overall quality of the man-machine dialogue. The evaluation makes use of established metrics: the transaction success, the contextual appropriateness of system answers, the calculation of normal and correction turns in a dialogue. We also define a new metric, the implicit recovery, which allows to measure the ability of a dialogue manager to deal with errors by different levels of analysis. We report evaluation data from several experiments, and we compare two different approaches to dialogue repair strategies using the set of metrics we argue for.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Natural Language Processing Techniques · Speech Recognition and Synthesis
