PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs,, Haizhou Li

TL;DR
The paper introduces PoE, a multi-domain dialogue evaluation model using a shared transformer encoder and domain-specific adapters, achieving state-of-the-art results and strong zero-shot generalization across diverse dialogue datasets.
Contribution
It proposes a novel Panel of Experts architecture with domain-specific adapters for improved multi-domain dialogue assessment.
Findings
Achieves state-of-the-art correlation scores across 16 datasets.
Demonstrates superior zero-shot generalization compared to existing metrics.
Easily adapts to new domains with few-shot learning.
Abstract
Chatbots are expected to be knowledgeable across multiple domains, e.g. for daily chit-chat, exchange of information, and grounding in emotional situations. To effectively measure the quality of such conversational agents, a model-based automatic dialogue evaluation metric (ADEM) is expected to perform well across multiple domains. Despite significant progress, an ADEM that works well in one domain does not necessarily generalize to another. This calls for a dedicated network architecture for domain generalization. To tackle the multi-domain dialogue evaluation task, we propose a Panel of Experts (PoE), a multitask network that consists of a shared transformer encoder and a collection of lightweight adapters. The shared encoder captures the general knowledge of dialogues across domains, while each adapter specializes in one specific domain and serves as a domain expert. To validate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Speech Recognition and Synthesis
MethodsAdapter
