ChatGPT4.o Geriatrics Knowledge Competency and Its Evaluation by Geriatricians

Iriana Hammel; Natasha Resendes; Dominique Tosi; Gokhan Demir; Huai Cheng

PMC · DOI:10.1093/geroni/igaf122.1182·December 31, 2025

ChatGPT4.o Geriatrics Knowledge Competency and Its Evaluation by Geriatricians

Iriana Hammel, Natasha Resendes, Dominique Tosi, Gokhan Demir, Huai Cheng

PDF

Open Access

TL;DR

ChatGPT4.o performed better than medical trainees on geriatric knowledge tests and received high ratings from geriatricians.

Contribution

This study evaluates ChatGPT4.o's geriatric knowledge using validated tests and compares it to trainees and geriatrician ratings.

Findings

01

ChatGPT4.o scored 18 on geriatric knowledge tests, outperforming all trainee groups.

02

Six geriatricians rated ChatGPT4.o's performance at 4.2 on a 5-point scale.

03

ChatGPT4.o's performance was deemed competent in geriatric knowledge.

Abstract

ChatGPT has passed USMLE and other medical knowledge exams, demonstrating competence in the medical field. It is less studied in Geriatric Medicine specifically. This study aimed to evaluate the geriatric competency of ChatGPT4.o by examining its performance on the validated UCLA Geriatrics knowledge tests, comparing its performance with that of trainees and exploring whether geriatricians agree with ChatGPT 4.o’s performance. 18 UCLA Geriatrics knowledge questions were answered. “Correct answer” was graded as 1, “incorrect answer” as -1 and “don’t know” as 0. The total score was between -18 to + 18. Test scores were calculated to compare ChatGPT and trainees (medical students, internal medicine residents and Geriatric medicine fellows) from previously published studies. ChatGPT4.o responses were evaluated by participants and graded on a Likert scale of 1-5 (1=strongly disagree,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Clinical Reasoning and Diagnostic Skills · Digital Mental Health Interventions