In-Context Learning for Preserving Patient Privacy: A Framework for Synthesizing Realistic Patient Portal Messages
Joseph Gatto, Parker Seegmiller, Timothy E. Burdick, Sarah Masud Preum

TL;DR
This paper presents an LLM-based framework for generating realistic, privacy-preserving synthetic patient portal messages to aid research and reduce clinician burnout, requiring minimal de-identification.
Contribution
The authors introduce a configurable, few-shot grounded text generation framework that produces high-quality, HIPAA-friendly synthetic patient messages, advancing privacy-preserving data sharing.
Findings
Generated data outperforms existing methods in quality
Framework is deemed HIPAA-friendly by clinical experts
Enables large-scale synthetic dataset creation
Abstract
Since the COVID-19 pandemic, clinicians have seen a large and sustained influx in patient portal messages, significantly contributing to clinician burnout. To the best of our knowledge, there are no large-scale public patient portal messages corpora researchers can use to build tools to optimize clinician portal workflows. Informed by our ongoing work with a regional hospital, this study introduces an LLM-powered framework for configurable and realistic patient portal message generation. Our approach leverages few-shot grounded text generation, requiring only a small number of de-identified patient portal messages to help LLMs better match the true style and tone of real data. Clinical experts in our team deem this framework as HIPAA-friendly, unlike existing privacy-preserving approaches to synthetic text generation which cannot guarantee all sensitive attributes will be protected.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectronic Health Records Systems · Mobile Health and mHealth Applications · Digital Mental Health Interventions
