Leveraging Large Language Models to Extract Information on Substance Use Disorder Severity from Clinical Notes: A Zero-shot Learning Approach
Maria Mahbub, Gregory M. Dams, Sudarshan Srinivasan, Caitlin Rizy,, Ioana Danciu, Jodie Trafton, Kathryn Knight

TL;DR
This paper explores using large language models with zero-shot learning to extract detailed substance use disorder severity information from clinical notes, addressing limitations of traditional NLP methods and enhancing clinical decision-making.
Contribution
It introduces a workflow leveraging LLMs with prompts and post-processing to improve extraction of SUD severity from unstructured clinical text, demonstrating superior recall over rule-based methods.
Findings
LLMs outperform rule-based approaches in recall for SUD severity extraction
Effective extraction across 11 SUD categories from clinical notes
Demonstrates potential for improved risk assessment and treatment planning
Abstract
Substance use disorder (SUD) poses a major concern due to its detrimental effects on health and society. SUD identification and treatment depend on a variety of factors such as severity, co-determinants (e.g., withdrawal symptoms), and social determinants of health. Existing diagnostic coding systems used by American insurance providers, like the International Classification of Diseases (ICD-10), lack granularity for certain diagnoses, but clinicians will add this granularity (as that found within the Diagnostic and Statistical Manual of Mental Disorders classification or DSM-5) as supplemental unstructured text in clinical notes. Traditional natural language processing (NLP) methods face limitations in accurately parsing such diverse clinical language. Large Language Models (LLMs) offer promise in overcoming these challenges by adapting to diverse language patterns. This study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterpreting and Communication in Healthcare · Mental Health via Writing · Machine Learning in Healthcare
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Flan-T5
