Assesment of Generative AI abilities to diagnose and propose treatment in comparison with psychiatrists from Poland and Tunisa
M. Sinica, A. Malec, H. Sghaier, P. Kalkowski

TL;DR
This study compares the diagnostic and treatment proposal abilities of generative AI with psychiatrists from Poland and Tunisia.
Contribution
The study introduces a novel comparison between generative AI and human psychiatrists using a Turing test framework.
Findings
Generative AI provided valid diagnoses, especially with newer versions of ChatGPT.
Treatment proposals from AI were less accurate compared to diagnoses.
Human psychiatrists were generally favored in assessments over AI.
Abstract
Increasing popularity of Generative AI systems such as GPT provides us with new dilemmas concerning the future of diagnosis and novel tools to improve daily psychiatrits’s work. The aim of the study was to assess the abilities of generative AI to diagnose and propose treatment in comparison with real psychiatrists and performing a Turing test. We examined the ability to diagnose and propose treatment of various Generative AI versions (CHatGPT/CHATGPTpro etc.) and then compare the results with 10 clinicians performing the same task. Then a group of 10 psychiatry specialists not involved in the first evaluation assessed wether the diagnose and treatment were established by Generative AI or a clinician. The reults showed that the generative AI systems were able to provide valid diagnosis in most of the cases with favour to newer and most proficient version of CHATGPT. Proposed treatment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education
