Towards Conditioning Clinical Text Generation for User Control

Osman Alperen Kora\c{s}; Rabi Bahnan; Jens Kleesiek; Amin Dada

arXiv:2502.17571·cs.CL·February 26, 2025

Towards Conditioning Clinical Text Generation for User Control

Osman Alperen Kora\c{s}, Rabi Bahnan, Jens Kleesiek, Amin Dada

PDF

Open Access 1 Video

TL;DR

This paper presents a method using large language models to improve control over clinical text generation, reducing hallucinations and factual errors without increasing clinician workload, and achieves state-of-the-art results on a relevant shared task.

Contribution

It introduces automated dataset augmentation with LLMs as human proxies to enhance clinician control in clinical text generation, achieving significant performance improvements.

Findings

01

9% relative improvement without augmented training

02

up to 34% improvement with dataset augmentation

03

Preliminary human evaluation supports effectiveness

Abstract

Deploying natural language generation systems in clinical settings remains challenging despite advances in Large Language Models (LLMs), which continue to exhibit hallucinations and factual inconsistencies, necessitating human oversight. This paper explores automated dataset augmentation using LLMs as human proxies to condition LLMs for clinician control without increasing cognitive workload. On the BioNLP ACL'24 Discharge Me! Shared Task, we achieve new state-of-the-art results with simpler methods than prior submissions through more efficient training, yielding a 9\% relative improvement without augmented training and up to 34\% with dataset augmentation. Preliminary human evaluation further supports the effectiveness of our approach, highlighting the potential of augmenting clinical text generation for control to enhance relevance, accuracy, and factual consistency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Conditioning Clinical Text Generation for User Control· underline

Taxonomy

TopicsSemantic Web and Ontologies