Towards Quantifiable Dialogue Coherence Evaluation
Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin, Xiaodan Liang

TL;DR
This paper introduces QuantiDCE, a novel framework for quantifiable dialogue coherence evaluation that aligns better with human ratings by using multi-level ranking and knowledge distillation with limited data.
Contribution
The paper proposes a two-stage training framework, including multi-level ranking and knowledge distillation, to produce a dialogue coherence metric that reflects human rating standards.
Findings
QuantiDCE achieves higher correlation with human judgments than existing metrics.
The framework effectively learns from limited human-annotated data.
The KD regularization enhances generalizability across different datasets.
Abstract
Automatic dialogue coherence evaluation has attracted increasing attention and is crucial for developing promising dialogue systems. However, existing metrics have two major limitations: (a) they are mostly trained in a simplified two-level setting (coherent vs. incoherent), while humans give Likert-type multi-level coherence scores, dubbed as "quantifiable"; (b) their predicted coherence scores cannot align with the actual human rating standards due to the absence of human guidance during training. To address these limitations, we propose Quantifiable Dialogue Coherence Evaluation (QuantiDCE), a novel framework aiming to train a quantifiable dialogue coherence metric that can reflect the actual human rating standards. Specifically, QuantiDCE includes two training stages, Multi-Level Ranking (MLR) pre-training and Knowledge Distillation (KD) fine-tuning. During MLR pre-training, a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsKnowledge Distillation
