Breakdowns in Conversational AI: Interactional Failures in Emotionally and Ethically Sensitive Contexts
Jiawen Deng, Wentao Zhang, Ziyun Jiao, Fuji Ren

TL;DR
This paper investigates interactional failures in conversational AI during emotionally and ethically sensitive interactions, highlighting common breakdowns and proposing a taxonomy for diagnosis and improvement.
Contribution
It introduces a persona-conditioned user simulator to stress-test models and identifies key failure patterns affecting dialogue quality in sensitive contexts.
Findings
Mainstream models show recurrent affective and ethical failures.
Failures intensify with escalating emotional trajectories.
A taxonomy of breakdown patterns guides future improvements.
Abstract
Conversational AI is increasingly deployed in emotionally charged and ethically sensitive interactions. Previous research has primarily concentrated on emotional benchmarks or static safety checks, overlooking how alignment unfolds in evolving conversation. We explore the research question: what breakdowns arise when conversational agents confront emotionally and ethically sensitive behaviors, and how do these affect dialogue quality? To stress-test chatbot performance, we develop a persona-conditioned user simulator capable of engaging in multi-turn dialogue with psychological personas and staged emotional pacing. Our analysis reveals that mainstream models exhibit recurrent breakdowns that intensify as emotional trajectories escalate. We identify several common failure patterns, including affective misalignments, ethical guidance failures, and cross-dimensional trade-offs where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
