High-Throughput Machine Learning from Electronic Health Records
Ross S. Kleiman, Paul S. Bennett, Peggy L. Peissig, Richard L. Berg,, Zhaobin Kuang, Scott J. Hebbring, Michael D. Caldwell, David Page

TL;DR
This study demonstrates that machine learning can accurately predict risks for thousands of diagnoses from electronic health records, providing a comprehensive patient risk profile and a new dataset for clinical prediction research.
Contribution
It introduces a novel approach for simultaneous prediction of risks across thousands of diagnosis codes using EHR data, achieving high predictive performance.
Findings
Average AUCs of 0.803 and 0.758 for 1 and 6 months predictions
High performance across thousands of diagnosis prediction tasks
Provides a new dataset for clinical risk prediction research
Abstract
The widespread digitization of patient data via electronic health records (EHRs) has created an unprecedented opportunity to use machine learning algorithms to better predict disease risk at the patient level. Although predictive models have previously been constructed for a few important diseases, such as breast cancer and myocardial infarction, we currently know very little about how accurately the risk for most diseases or events can be predicted, and how far in advance. Machine learning algorithms use training data rather than preprogrammed rules to make predictions and are well suited for the complex task of disease prediction. Although there are thousands of conditions and illnesses patients can encounter, no prior research simultaneously predicts risks for thousands of diagnosis codes and thereby establishes a comprehensive patient risk profile. Here we show that such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Explainable Artificial Intelligence (XAI)
