Scalable Medication Extraction and Discontinuation Identification from Electronic Health Records Using Large Language Models
Chong Shao, Douglas Snyder, Chiran Li, Bowen Gu, Kerry Ngan, Chun-Ting Yang, Jiageng Wu, Richard Wyss, Kueiyu Joshua Lin, Jie Yang

TL;DR
This study evaluates large language models for extracting medication information and identifying discontinuations from EHR notes, demonstrating high performance and scalability, especially with GPT-4o and open-source models, without extensive human annotation.
Contribution
It systematically compares multiple LLMs for medication extraction and discontinuation classification from EHRs, highlighting the potential of open-source models as scalable alternatives.
Findings
GPT-4o achieved highest F1 scores: 94.0% extraction, 78.1% classification.
Open-source Llama-3.1-70B-Instruct performed well in classification tasks.
Few-shot learning improved model performance, CoT reasoning had mixed effects.
Abstract
Identifying medication discontinuations in electronic health records (EHRs) is vital for patient safety but is often hindered by information being buried in unstructured notes. This study aims to evaluate the capabilities of advanced open-sourced and proprietary large language models (LLMs) in extracting medications and classifying their medication status from EHR notes, focusing on their scalability on medication information extraction without human annotation. We collected three EHR datasets from diverse sources to build the evaluation benchmark. We evaluated 12 advanced LLMs and explored multiple LLM prompting strategies. Performance on medication extraction, medication status classification, and their joint task (extraction then classification) was systematically compared across all experiments. We found that LLMs showed promising performance on the medication extraction and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Data Quality and Management
