Towards Quantifiable Dialogue Coherence Evaluation

Zheng Ye; Liucun Lu; Lishan Huang; Liang Lin; Xiaodan Liang

arXiv:2106.00507·cs.CL·July 23, 2021

Towards Quantifiable Dialogue Coherence Evaluation

Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin, Xiaodan Liang

PDF

Open Access 1 Repo

TL;DR

This paper introduces QuantiDCE, a novel framework for quantifiable dialogue coherence evaluation that aligns better with human ratings by using multi-level ranking and knowledge distillation with limited data.

Contribution

The paper proposes a two-stage training framework, including multi-level ranking and knowledge distillation, to produce a dialogue coherence metric that reflects human rating standards.

Findings

01

QuantiDCE achieves higher correlation with human judgments than existing metrics.

02

The framework effectively learns from limited human-annotated data.

03

The KD regularization enhances generalizability across different datasets.

Abstract

Automatic dialogue coherence evaluation has attracted increasing attention and is crucial for developing promising dialogue systems. However, existing metrics have two major limitations: (a) they are mostly trained in a simplified two-level setting (coherent vs. incoherent), while humans give Likert-type multi-level coherence scores, dubbed as "quantifiable"; (b) their predicted coherence scores cannot align with the actual human rating standards due to the absence of human guidance during training. To address these limitations, we propose Quantifiable Dialogue Coherence Evaluation (QuantiDCE), a novel framework aiming to train a quantifiable dialogue coherence metric that can reflect the actual human rating standards. Specifically, QuantiDCE includes two training stages, Multi-Level Ranking (MLR) pre-training and Knowledge Distillation (KD) fine-tuning. During MLR pre-training, a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

James-Yip/QuantiDCE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsKnowledge Distillation