A Crowd-based Evaluation of Abuse Response Strategies in Conversational   Agents

Amanda Cercas Curry; Verena Rieser

arXiv:1909.04387·cs.HC·September 11, 2019

A Crowd-based Evaluation of Abuse Response Strategies in Conversational Agents

Amanda Cercas Curry, Verena Rieser

PDF

1 Repo

TL;DR

This study evaluates various abuse response strategies in conversational agents through large-scale crowd-sourcing, revealing that polite refusal is most effective and that demographic factors influence user perception.

Contribution

It provides a comprehensive comparison of abuse response strategies, highlighting the effectiveness of rule-based approaches over data-driven models in user perception.

Findings

01

Polite refusal is rated highly across users.

02

Demographic factors influence response appropriateness.

03

Data-driven models lag behind rule-based systems.

Abstract

How should conversational agents respond to verbal abuse through the user? To answer this question, we conduct a large-scale crowd-sourced evaluation of abuse response strategies employed by current state-of-the-art systems. Our results show that some strategies, such as "polite refusal" score highly across the board, while for other strategies demographic factors, such as age, as well as the severity of the preceding abuse influence the user's perception of which response is appropriate. In addition, we find that most data-driven models lag behind rule-based or commercial systems in terms of their perceived appropriateness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amandacurry/metoo_corpus
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.