A Novel Data-Driven Framework for Risk Characterization and Prediction from Electronic Medical Records: A Case Study of Renal Failure
Prithwish Chakraborty, Vishrawas Gopalakrishnan, Sharon M.H., Alford, Faisal Farooq

TL;DR
This paper introduces a scalable, data-driven framework that analyzes electronic medical records to identify and validate risk factors for renal failure in diabetic patients, uncovering both known and novel predictors without expert guidance.
Contribution
The paper presents a novel, disease-agnostic framework that systematically uncovers important risk factors from EMR data using regularized survival models, validated through outcome prediction.
Findings
Identified risk factors overlap with expert-recognized characteristics.
Framework effectively predicts renal failure within specified time windows.
Method uncovers new hypotheses for disease risk factors.
Abstract
Electronic medical records (EMR) contain longitudinal information about patients that can be used to analyze outcomes. Typically, studies on EMR data have worked with established variables that have already been acknowledged to be associated with certain outcomes. However, EMR data may also contain hitherto unrecognized factors for risk association and prediction of outcomes for a disease. In this paper, we present a scalable data-driven framework to analyze EMR data corpus in a disease agnostic way that systematically uncovers important factors influencing outcomes in patients, as supported by data and without expert guidance. We validate the importance of such factors by using the framework to predict for the relevant outcomes. Specifically, we analyze EMR data covering approximately 47 million unique patients to characterize renal failure (RF) among type 2 diabetic (T2DM) patients.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Chronic Disease Management Strategies · Artificial Intelligence in Healthcare
