SoK: Privacy-aware LLM in Healthcare: Threat Model, Privacy Techniques, Challenges and Recommendations

Mohoshin Ara Tahera; Karamveer Singh Sidhu; Shuvalaxmi Dass; Sajal Saha

arXiv:2601.10004·cs.CR·January 16, 2026

SoK: Privacy-aware LLM in Healthcare: Threat Model, Privacy Techniques, Challenges and Recommendations

Mohoshin Ara Tahera, Karamveer Singh Sidhu, Shuvalaxmi Dass, Sajal Saha

PDF

Open Access

TL;DR

This paper systematically analyzes privacy threats and defenses for large language models in healthcare, highlighting challenges across data processing, fine-tuning, and inference phases, and proposing future research directions.

Contribution

It provides a comprehensive threat model and evaluates existing privacy-preserving techniques for LLMs in healthcare, identifying limitations and offering phase-aware recommendations.

Findings

01

Existing defenses have limitations in diverse healthcare settings.

02

Threat landscape varies across LLM phases: data, fine-tuning, inference.

03

Recommendations aim to enhance privacy guarantees in regulated environments.

Abstract

Large Language Models (LLMs) are increasingly adopted in healthcare to support clinical decision-making, summarize electronic health records (EHRs), and enhance patient care. However, this integration introduces significant privacy and security challenges, driven by the sensitivity of clinical data and the high-stakes nature of medical workflows. These risks become even more pronounced across heterogeneous deployment environments, ranging from small on-premise hospital systems to regional health networks, each with unique resource limitations and regulatory demands. This Systematization of Knowledge (SoK) examines the evolving threat landscape across the three core LLM phases: Data preprocessing, Fine-tuning, and Inference within realistic healthcare settings. We present a detailed threat model that characterizes adversaries, capabilities, and attack surfaces at each phase, and we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Privacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning