TL;DR
This study evaluates various abuse response strategies in conversational agents through large-scale crowd-sourcing, revealing that polite refusal is most effective and that demographic factors influence user perception.
Contribution
It provides a comprehensive comparison of abuse response strategies, highlighting the effectiveness of rule-based approaches over data-driven models in user perception.
Findings
Polite refusal is rated highly across users.
Demographic factors influence response appropriateness.
Data-driven models lag behind rule-based systems.
Abstract
How should conversational agents respond to verbal abuse through the user? To answer this question, we conduct a large-scale crowd-sourced evaluation of abuse response strategies employed by current state-of-the-art systems. Our results show that some strategies, such as "polite refusal" score highly across the board, while for other strategies demographic factors, such as age, as well as the severity of the preceding abuse influence the user's perception of which response is appropriate. In addition, we find that most data-driven models lag behind rule-based or commercial systems in terms of their perceived appropriateness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
