The Effect of Epidemiological Cohort Creation on the Machine Learning Prediction of Homelessness and Police Interaction Outcomes Using Administrative Health Care Data
Faezehsadat Shahidi, M. Ethan MacDonald, Dallas Seitz, Geoffrey, Messier

TL;DR
This study demonstrates that using flexible, adaptive observation windows in cohort creation enhances the performance of machine learning models in predicting homelessness and police interactions among individuals with mental health issues, compared to fixed windows.
Contribution
It introduces a flexible cohort creation method that improves predictive accuracy of ML models for adverse outcomes in mental health populations.
Findings
XGBoost outperformed other models with 91% sensitivity and 90% AUC.
Flexible windows improved model performance over fixed observation periods.
Key risk factors include male sex, substance disorder, and psychiatric visits.
Abstract
Background: Mental illness can lead to adverse outcomes such as homelessness and police interaction and understanding of the events leading up to these adverse outcomes is important. Predictive models may help identify individuals at risk of such adverse outcomes. Using a fixed observation window cohort with logistic regression (LR) or machine learning (ML) models can result in lower performance when compared with adaptive and parcellated windows. Method: An administrative healthcare dataset was used, comprising of 240,219 individuals in Calgary, Alberta, Canada who were diagnosed with addiction or mental health (AMH) between April 1, 2013, and March 31, 2018. The cohort was followed for 2 years to identify factors associated with homelessness and police interactions. To understand the benefit of flexible windows to predictive models, an alternative cohort was created. Then LR and ML…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHomelessness and Social Issues · Urban, Neighborhood, and Segregation Studies
MethodsLogistic Regression
