Predicting first-episode homelessness among US Veterans using longitudinal EHR data: time-varying models and social risk factors
Rohan Pandey, Haijuan Yan, Hong Yu, Jack Tsai

TL;DR
This study develops longitudinal EHR-based models incorporating social factors to predict first-episode homelessness among US veterans, enabling targeted interventions with improved accuracy over traditional methods.
Contribution
It introduces a novel approach combining static and time-varying EHR representations with social risk factors, comparing classical ML and large language models for homelessness prediction.
Findings
Incorporating social factors improved PR-AUC by 15-30%.
Top 1% risk models achieved PPV up to 13.80%.
Large language models showed smaller racial disparities.
Abstract
Homelessness among US veterans remains a critical public health challenge, yet risk prediction offers a pathway for proactive intervention. In this retrospective prognostic study, we analyzed electronic health record (EHR) data from 4,276,403 Veterans Affairs patients during a 2016 observation period to predict first-episode homelessness occurring 3-12 months later in 2017 (prevalence: 0.32-1.19%). We constructed static and time-varying EHR representations, utilizing clinician-informed logic to model the persistence of clinical conditions and social risks over time. We then compared the performance of classical machine learning, transformer-based masked language models, and fine-tuned large language models (LLMs). We demonstrate that incorporating social and behavioral factors into longitudinal models improved precision-recall area under the curve (PR-AUC) by 15-30%. In the top 1% risk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHomelessness and Social Issues · Food Security and Health in Diverse Populations · Migration, Health and Trauma
