Machine learning using longitudinal prescription and medical claims for the detection of nonalcoholic steatohepatitis (NASH)
Ozge Yasar, Patrick Long, Brett Harder, Hanna Marshall, Sanjay Bhasin,, Suyin Lee, Mark Delegge, Stephanie Roy, Orla Doyle, Nadea Leavitt, John Rigg

TL;DR
This study develops machine learning models using longitudinal medical claims data to identify undiagnosed NASH patients, addressing the challenge of underdiagnosis in clinical practice.
Contribution
It introduces novel gradient-boosted decision tree models trained on claims data to detect likely NASH patients among at-risk populations.
Findings
Model for all at-risk patients achieved AUPRC of 0.0107 and AUROC of 0.84.
Model for non-NAFL at-risk patients achieved AUPRC of 0.003 and AUROC of 0.78.
At 10% recall, precision was significantly above NASH incidence rates.
Abstract
Objectives To develop and evaluate machine learning models to detect suspected undiagnosed nonalcoholic steatohepatitis (NASH) patients for diagnostic screening and clinical management. Methods In this retrospective observational noninterventional study using administrative medical claims data from 1,463,089 patients, gradient-boosted decision trees were trained to detect likely NASH patients from an at-risk patient population with a history of obesity, type 2 diabetes mellitus (T2DM), metabolic disorder, or nonalcoholic fatty liver (NAFL). Models were trained to detect likely NASH in all at-risk patients or in the subset without a prior NAFL diagnosis (non-NAFL at-risk patients). Models were trained and validated using retrospective medical claims data and assessed using area under precision recall and receiver operating characteristic curves (AUPRCs, AUROCs). Results The 6-month…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLiver Disease Diagnosis and Treatment · Alcohol Consumption and Health Effects · Hepatitis C virus research
