Guidelines vs. generative AI in CKD patient education: the role of prompt engineering and expert blinded evaluation

Lutfullah Zahit Koc; Sevgi Gulsen Koc; Ayca Inci; Osman Cagin Buldukoglu; Gokhan Koker; Edgar V. Lerma

PMC · DOI:10.1186/s12882-026-04814-3·February 20, 2026

Guidelines vs. generative AI in CKD patient education: the role of prompt engineering and expert blinded evaluation

Lutfullah Zahit Koc, Sevgi Gulsen Koc, Ayca Inci, Osman Cagin Buldukoglu, Gokhan Koker, Edgar V. Lerma

PDF

Open Access

TL;DR

This study shows that AI models, especially when using structured prompts, can create better CKD patient education content than guidelines, with improved clarity and accessibility.

Contribution

The study introduces the effectiveness of prompt engineering in improving AI-generated CKD education content for better readability and accuracy.

Findings

01

AI models outperformed guideline responses in all CLEAR Tool domains, with ChatGPT-4o mini scoring highest.

02

Structured prompts significantly improved AI readability, reducing literacy requirements to around 7th-grade level.

03

Prompt engineering can enhance AI's usability for populations with limited health literacy.

Abstract

This study aimed to evaluate the accuracy, content quality, and readability of patient education responses related to chronic kidney disease (CKD) generated by large language models (ChatGPT-4o mini and Gemini) compared to guideline group. Fifteen frequently asked CKD-related questions were selected using global Google Trends data and posed to both AI models and guideline-based sources. Responses were anonymized and evaluated by four independent nephrology professors using the CLEAR Tool, assessing completeness, appropriateness, evidence basis, and clarity. Both AI models significantly outperformed guideline responses across all CLEAR Tool domains (p < 0.001), with ChatGPT-4o mini achieving the highest median score (21.0 [IQR: 5.0] vs. Gemini: 17.0 [IQR: 5.0], Guideline: 13.0 [IQR: 2.0]). Initial readability analysis showed that guideline responses were easier to comprehend…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases2

chronic kidney disease CKD

Figures3

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Health Literacy and Information Accessibility · Social Media in Health Education