Challenges of using generative AI for patient education in chronic heart failure: an evaluation of content quality, readability, and actionability in cross-platform LLM-generated texts

Zhiqiang Wang; Xiaoya Li; Chao Ma; Zhiwen Zhang

PMC · DOI:10.3389/fpubh.2026.1801829·March 5, 2026

Challenges of using generative AI for patient education in chronic heart failure: an evaluation of content quality, readability, and actionability in cross-platform LLM-generated texts

Zhiqiang Wang, Xiaoya Li, Chao Ma, Zhiwen Zhang

PDF

Open Access

TL;DR

This study evaluates how well different AI platforms generate patient education materials for chronic heart failure, finding trade-offs between readability and information completeness.

Contribution

The paper introduces a framework for assessing LLM-generated patient education content and identifies platform-specific strengths and weaknesses.

Findings

01

Doubao and Kimi K2 produced the highest overall quality texts for patient education.

02

DeepSeek-R1 provided the most complete information but had the lowest readability.

03

ERNIEBot 4.5 Turbo and Qwen3-Max-Thinking-Preview were most readable but less comprehensive.

Abstract

To compare the differences in content quality, readability, and actionability of patient education texts for self-management of chronic heart failure (CHF) generated by five mainstream large language models (LLMs) in China, and to provide a basis for platform selection and assessment framework construction for clinical use. A standardized set of 20 questions was developed based on literature review, guidelines, and consensus from cardiovascular experts, covering disease awareness, diagnosis and classification, treatment and rehabilitation, daily management and prevention, and psychosocial dimensions. Using a uniform prompt, responses were generated by DeepSeek-R1, Doubao, ERNIEBot 4.5 Turbo, Qwen3-Max-Thinking-Preview, and Kimi K2. The PEMAT-P scale was used to assess understandability and actionability, 36-item expanded EQIP (EQIP-36 score) scale was used to evaluate information…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

CHF

Figures3

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Heart Failure Treatment and Management · Machine Learning in Healthcare