Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI

Agnik Saha; Victoria Churchill; Anny D. Rodriguez; Ugur Kursuncu; Muhammed Y. Idris

arXiv:2505.10472·cs.CL·May 19, 2025

Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI

Agnik Saha, Victoria Churchill, Anny D. Rodriguez, Ugur Kursuncu, Muhammed Y. Idris

PDF

Open Access

TL;DR

This study evaluates various large language models' ability to generate accurate, safe, and accessible cancer-related information, revealing strengths in linguistic quality and accessibility but challenges in safety and bias mitigation.

Contribution

It provides a comprehensive evaluation of general-purpose and medical LLMs for cancer communication, highlighting their strengths and limitations in safety, trustworthiness, and accessibility.

Findings

01

General-purpose LLMs excel in linguistic quality and affectiveness.

02

Medical LLMs offer greater communication accessibility.

03

Medical LLMs show higher potential for harm, toxicity, and bias.

Abstract

Effective communication about breast and cervical cancers remains a persistent health challenge, with significant gaps in public understanding of cancer prevention, screening, and treatment, potentially leading to delayed diagnoses and inadequate treatments. This study evaluates the capabilities and limitations of Large Language Models (LLMs) in generating accurate, safe, and accessible cancer-related information to support patient understanding. We evaluated five general-purpose and three medical LLMs using a mixed-methods evaluation framework across linguistic quality, safety and trustworthiness, and communication accessibility and affectiveness. Our approach utilized quantitative metrics, qualitative expert ratings, and statistical analysis using Welch's ANOVA, Games-Howell, and Hedges' g. Our results show that general-purpose LLMs produced outputs of higher linguistic quality and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Mental Health via Writing