Challenges of using generative AI for patient education in chronic heart failure: an evaluation of content quality, readability, and actionability in cross-platform LLM-generated texts
Zhiqiang Wang, Xiaoya Li, Chao Ma, Zhiwen Zhang

TL;DR
This study evaluates how well different AI platforms generate patient education materials for chronic heart failure, finding trade-offs between readability and information completeness.
Contribution
The paper introduces a framework for assessing LLM-generated patient education content and identifies platform-specific strengths and weaknesses.
Findings
Doubao and Kimi K2 produced the highest overall quality texts for patient education.
DeepSeek-R1 provided the most complete information but had the lowest readability.
ERNIEBot 4.5 Turbo and Qwen3-Max-Thinking-Preview were most readable but less comprehensive.
Abstract
To compare the differences in content quality, readability, and actionability of patient education texts for self-management of chronic heart failure (CHF) generated by five mainstream large language models (LLMs) in China, and to provide a basis for platform selection and assessment framework construction for clinical use. A standardized set of 20 questions was developed based on literature review, guidelines, and consensus from cardiovascular experts, covering disease awareness, diagnosis and classification, treatment and rehabilitation, daily management and prevention, and psychosocial dimensions. Using a uniform prompt, responses were generated by DeepSeek-R1, Doubao, ERNIEBot 4.5 Turbo, Qwen3-Max-Thinking-Preview, and Kimi K2. The PEMAT-P scale was used to assess understandability and actionability, 36-item expanded EQIP (EQIP-36 score) scale was used to evaluate information…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Heart Failure Treatment and Management · Machine Learning in Healthcare
