# Promises and challenges of applying large language models in the healthcare domain

**Authors:** Qingyu Wang, Ziheng Gong, Zou Lai, Lina Bu, Fried-Michael Dahlweid, Hong Sun

PMC · DOI: 10.3389/fdgth.2026.1772274 · 2026-03-17

## TL;DR

This paper reviews how large language models are being used in healthcare, comparing general and specialized models and discussing their benefits and challenges.

## Contribution

The paper contrasts general-purpose and domain-specific models in healthcare and outlines future directions like retrieval-augmented generation.

## Key findings

- General-purpose models adapt to healthcare via prompt engineering, while domain-specific models align with medical knowledge graphs.
- Challenges include hallucination, privacy issues, and unclear evaluation metrics.
- Future routes include retrieval-augmented generation and agentic architectures.

## Abstract

Large language models are rapidly moving from theoretical concepts to active clinical pilots. Current approaches diverge between general-purpose models, which adapt to healthcare via prompt engineering, and domain-specific models, which prioritize deep alignment with medical knowledge graphs to ensure safety. Despite reported benefits in documentation efficiency and diagnostic reasoning, significant challenges remain regarding hallucination, privacy, and the validity of evaluation metrics. This Mini Review synthesizes current evidence, contrasts these two modeling paradigms, highlights key controversies, and maps out future development routes including retrieval-augmented generation and agentic architectures.

## Full-text entities

- **Diseases:** hallucination (MESH:D006212)

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13035794/full.md

---
Source: https://tomesphere.com/paper/PMC13035794