Zero-shot and Few-shot Generation Strategies for Artificial Clinical   Records

Erlend Frayling; Jake Lever; Graham McDonald

arXiv:2403.08664·cs.CL·March 15, 2024·1 cites

Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records

Erlend Frayling, Jake Lever, Graham McDonald

PDF

Open Access

TL;DR

This paper explores zero-shot and few-shot prompting strategies using Llama 2 to generate synthetic medical records, achieving high accuracy without training on sensitive patient data, thus addressing privacy concerns in clinical research.

Contribution

It introduces a novel chain-of-thought prompting technique that enables zero-shot LLMs to produce accurate synthetic medical narratives comparable to fine-tuned models.

Findings

01

Chain-of-thought prompting improves accuracy in synthetic record generation.

02

Zero-shot approach matches fine-tuned models in Rouge metrics.

03

Method enhances privacy-preserving data synthesis for medical research.

Abstract

The challenge of accessing historical patient data for clinical research, while adhering to privacy regulations, is a significant obstacle in medical science. An innovative approach to circumvent this issue involves utilising synthetic medical records that mirror real patient data without compromising individual privacy. The creation of these synthetic datasets, particularly without using actual patient data to train Large Language Models (LLMs), presents a novel solution as gaining access to sensitive patient information to train models is also a challenge. This study assesses the capability of the Llama 2 LLM to create synthetic medical records that accurately reflect real patient information, employing zero-shot and few-shot prompting strategies for comparison against fine-tuned methodologies that do require sensitive patient data during training. We focus on generating synthetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · AI in cancer detection

MethodsFocus