The Efficacy of Conversational Artificial Intelligence in Rectifying the   Theory of Mind and Autonomy Biases: Comparative Analysis

Marcin Rz\k{a}deczka; Anna Sterna; Julia Stoli\'nska; Paulina; Kaczy\'nska; Marcin Moskalewicz

arXiv:2406.13813·cs.CY·February 11, 2025

The Efficacy of Conversational Artificial Intelligence in Rectifying the Theory of Mind and Autonomy Biases: Comparative Analysis

Marcin Rz\k{a}deczka, Anna Sterna, Julia Stoli\'nska, Paulina, Kaczy\'nska, Marcin Moskalewicz

PDF

TL;DR

This study compares conversational AI models in their ability to identify and correct cognitive biases and recognize affect, revealing that general-purpose models outperform specialized therapeutic chatbots, with implications for mental health applications.

Contribution

It provides a comparative analysis of therapeutic versus general-purpose AI chatbots in addressing cognitive biases and affect recognition, highlighting areas for improvement.

Findings

01

General-purpose chatbots outperform therapeutic ones in bias rectification

02

GPT-4 achieves the highest scores across biases

03

Therapeutic chatbots need refinement for better efficacy

Abstract

Background: The increasing deployment of Conversational Artificial Intelligence (CAI) in mental health interventions necessitates an evaluation of their efficacy in rectifying cognitive biases and recognizing affect in human-AI interactions. These biases, including theory of mind and autonomy biases, can exacerbate mental health conditions such as depression and anxiety. Objective: This study aimed to assess the effectiveness of therapeutic chatbots (Wysa, Youper) versus general-purpose language models (GPT-3.5, GPT-4, Gemini Pro) in identifying and rectifying cognitive biases and recognizing affect in user interactions. Methods: The study employed virtual case scenarios simulating typical user-bot interactions. Cognitive biases assessed included theory of mind biases (anthropomorphism, overtrust, attribution) and autonomy biases (illusion of control, fundamental attribution error,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.