The Efficacy of Conversational Artificial Intelligence in Rectifying the Theory of Mind and Autonomy Biases: Comparative Analysis
Marcin Rz\k{a}deczka, Anna Sterna, Julia Stoli\'nska, Paulina, Kaczy\'nska, Marcin Moskalewicz

TL;DR
This study compares conversational AI models in their ability to identify and correct cognitive biases and recognize affect, revealing that general-purpose models outperform specialized therapeutic chatbots, with implications for mental health applications.
Contribution
It provides a comparative analysis of therapeutic versus general-purpose AI chatbots in addressing cognitive biases and affect recognition, highlighting areas for improvement.
Findings
General-purpose chatbots outperform therapeutic ones in bias rectification
GPT-4 achieves the highest scores across biases
Therapeutic chatbots need refinement for better efficacy
Abstract
Background: The increasing deployment of Conversational Artificial Intelligence (CAI) in mental health interventions necessitates an evaluation of their efficacy in rectifying cognitive biases and recognizing affect in human-AI interactions. These biases, including theory of mind and autonomy biases, can exacerbate mental health conditions such as depression and anxiety. Objective: This study aimed to assess the effectiveness of therapeutic chatbots (Wysa, Youper) versus general-purpose language models (GPT-3.5, GPT-4, Gemini Pro) in identifying and rectifying cognitive biases and recognizing affect in user interactions. Methods: The study employed virtual case scenarios simulating typical user-bot interactions. Cognitive biases assessed included theory of mind biases (anthropomorphism, overtrust, attribution) and autonomy biases (illusion of control, fundamental attribution error,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
