# Evaluating the Presence of Empathic Communication in ChatGPT-Produced Clinical Notes Using Established Communication Frameworks

**Authors:** Sydney Bowden, Keyline Moreno, Frederick Million, Nicholas Azinge

PMC · DOI: 10.7759/cureus.102750 · Cureus · 2026-01-31

## TL;DR

This study evaluates whether ChatGPT can produce empathetic clinical notes and finds that empathy is measurable but limited and formulaic.

## Contribution

The study introduces a novel evaluation of AI-generated clinical notes using established empathy frameworks to assess empathic communication.

## Key findings

- Empathetic prompting increased CARE scores significantly compared to neutral prompting.
- Empathic language in ChatGPT notes was mostly generic and lacked context-specific emotional nuance.
- The study highlights the potential and limitations of AI in generating patient-centered clinical documentation.

## Abstract

Background: Empathy is a core component of effective physician-patient communication and is associated with improved clinical relationships and patient experience. As generative artificial intelligence (AI) models such as ChatGPT (OpenAI, San Francisco, California, United States) are increasingly explored for clinical documentation support, it is important to understand whether these systems can produce language that reflects empathic communication.

Objective: This study evaluated empathic communication in ChatGPT-generated clinical notes through two distinct approaches: (i) quantitative measurement of linguistic markers using established communication frameworks, and (ii) qualitative characterization of empathic styles and patterns.

Methods: A cross-sectional simulation study was conducted using ChatGPT (large language model, web-based interface, December 2025 version 5.1). Twenty standardized pediatric cases were created across psychiatry and gastroenterology contexts. For each case, two Subjective, Objective, Assessment, and Plan (SOAP) format clinical notes were generated under different prompting conditions: a neutral clinical tone and an explicitly empathetic clinical tone. Notes were evaluated using the Consultation and Relational Empathy (CARE) Measure and the Empathic Communication Coding System (ECCS).

Results: Forty clinical notes were analyzed by two independent blinded raters. Notes generated under empathetic prompting demonstrated higher mean CARE scores compared with neutral notes (3.8 vs. 2.6 on a five-point scale), with differences found to be statistically significant (two-tailed independent samples t-test, p < 0.001). Empathetic-tone notes also contained a greater frequency of empathic statements as measured by the ECCS. Empathic language primarily reflected generic supportive phrasing, cognitive acknowledgment of patient concerns, and action-oriented reassurance, while context-specific emotional nuance remained limited across both prompting conditions.

Conclusions: ChatGPT can generate clinical documentation containing measurable expressions of empathy when explicitly prompted; however, this empathy remains largely formulaic and dependent on prompt design. These findings highlight both the potential utility and limitations of generative AI tools in producing patient-centered clinical documentation.

## Full-text entities

- **Diseases:** Crohn's disease (MESH:D003424), Neutral Clinical Tone (MESH:D009122), chronic disease (MESH:D002908), ECCS (MESH:D003147), abdominal discomfort (MESH:D000007), anxiety (MESH:D001007), fatigue (MESH:D005221)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12953172/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12953172/full.md

---
Source: https://tomesphere.com/paper/PMC12953172