Leveraging Open-Source Large Language Models for encoding Social Determinants of Health using an Intelligent Router
Akul Goel, Surya Narayanan Hari, Belinda Waltman, Matt Thomson

TL;DR
This paper presents an intelligent routing system that directs clinical notes to open-source large language models optimized for specific Social Determinants of Health coding tasks, achieving state-of-the-art accuracy and addressing privacy concerns.
Contribution
The authors introduce a novel intelligent routing architecture that leverages multiple open-source LLMs for SDOH coding, outperforming closed models like GPT-4o.
Findings
Achieved 96.4% accuracy across 13 SDOH codes.
Outperformed closed-source models such as GPT-4o.
Utilized synthetic data generation to enhance training without privacy risks.
Abstract
Social Determinants of Health (SDOH), also known as Health-Related Social Needs (HSRN), play a significant role in patient health outcomes. The Centers for Disease Control and Prevention (CDC) introduced a subset of ICD-10 codes called Z-codes to recognize and measure SDOH. However, Z-codes are infrequently coded in a patient's Electronic Health Record (EHR), and instead, in many cases, need to be inferred from clinical notes. Previous research has shown that large language models (LLMs) show promise on extracting unstructured data from EHRs, but it can be difficult to identify a single model that performs best on varied coding tasks. Further, clinical notes contain protected health information posing a challenge for the use of closed-source language models from commercial vendors. The identification of open-source LLMs that can be run within health organizations and exhibit high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing
MethodsSparse Evolutionary Training
