Evaluator for Emotionally Consistent Chatbots

Chenxiao Liu; Guanzhi Deng; Tao Ji; Difei Tang; Silai Zheng

arXiv:2112.01616·cs.CL·December 6, 2021

Evaluator for Emotionally Consistent Chatbots

Chenxiao Liu, Guanzhi Deng, Tao Ji, Difei Tang, Silai Zheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces an evaluator designed to assess the emotional consistency of chatbots, addressing a key gap in current evaluation methods that focus on coherence and fluency.

Contribution

It proposes a novel training approach for an evaluator specifically targeting emotional consistency in chatbot responses.

Findings

01

Evaluator effectively measures emotional consistency

02

Improves assessment accuracy over existing metrics

03

Enhances chatbot development with emotional awareness

Abstract

One challenge for evaluating current sequence- or dialogue-level chatbots, such as Empathetic Open-domain Conversation Models, is to determine whether the chatbot performs in an emotionally consistent way. The most recent work only evaluates on the aspects of context coherence, language fluency, response diversity, or logical self-consistency between dialogues. This work proposes training an evaluator to determine the emotional consistency of chatbots.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osirislambert/chatbot-evaulator
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions