ChatGPT4.o Geriatrics Knowledge Competency and Its Evaluation by Geriatricians
Iriana Hammel, Natasha Resendes, Dominique Tosi, Gokhan Demir, Huai Cheng

TL;DR
ChatGPT4.o performed better than medical trainees on geriatric knowledge tests and received high ratings from geriatricians.
Contribution
This study evaluates ChatGPT4.o's geriatric knowledge using validated tests and compares it to trainees and geriatrician ratings.
Findings
ChatGPT4.o scored 18 on geriatric knowledge tests, outperforming all trainee groups.
Six geriatricians rated ChatGPT4.o's performance at 4.2 on a 5-point scale.
ChatGPT4.o's performance was deemed competent in geriatric knowledge.
Abstract
ChatGPT has passed USMLE and other medical knowledge exams, demonstrating competence in the medical field. It is less studied in Geriatric Medicine specifically. This study aimed to evaluate the geriatric competency of ChatGPT4.o by examining its performance on the validated UCLA Geriatrics knowledge tests, comparing its performance with that of trainees and exploring whether geriatricians agree with ChatGPT 4.o’s performance. 18 UCLA Geriatrics knowledge questions were answered. “Correct answer” was graded as 1, “incorrect answer” as -1 and “don’t know” as 0. The total score was between -18 to + 18. Test scores were calculated to compare ChatGPT and trainees (medical students, internal medicine residents and Geriatric medicine fellows) from previously published studies. ChatGPT4.o responses were evaluated by participants and graded on a Likert scale of 1-5 (1=strongly disagree,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Clinical Reasoning and Diagnostic Skills · Digital Mental Health Interventions
