PARADISE: A Framework for Evaluating Spoken Dialogue Agents
Marilyn A. Walker, Diane J. Litman, Candace A. Kamm, Alicia Abella, (ATT Labs - Research)

TL;DR
PARADISE is a comprehensive framework designed to evaluate spoken dialogue agents by analyzing their performance across various strategies, tasks, and dialogue segments, facilitating fair comparisons and understanding of factors influencing success.
Contribution
It introduces a general, task-independent evaluation framework that decouples dialogue behaviors from task requirements and supports detailed performance analysis.
Findings
Supports comparison of dialogue strategies
Enables performance evaluation over subdialogues
Normalizes for task complexity in comparisons
Abstract
This paper presents PARADISE (PARAdigm for DIalogue System Evaluation), a general framework for evaluating spoken dialogue agents. The framework decouples task requirements from an agent's dialogue behaviors, supports comparisons among dialogue strategies, enables the calculation of performance over subdialogues and whole dialogues, specifies the relative contribution of various factors to performance, and makes it possible to compare agents performing different tasks by normalizing for task complexity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Multi-Agent Systems and Negotiation
