Detecting clinically meaningful biomarkers with repeated measurements in an Electronic Health Record
Benjamin A Goldstein, Themistocles Assimes, Wolfgang C. Winkelmayer,, and Trevor Hastoe

TL;DR
This study develops a statistical method to identify meaningful biomarkers for acute myocardial infarction using irregular and sparse electronic health record data, demonstrating its practical utility with real-world data.
Contribution
Introduces a regression spline-based analytic approach for analyzing irregular EHR data to detect clinical biomarkers of acute events, without requiring specialized software.
Findings
EHR data can effectively identify biomarkers for acute MI.
The method works with irregular, sparse measurements without specialized tools.
Several laboratory markers showed significant predictive utility.
Abstract
Electronic health record (EHR) data are becoming an increasingly common data source for understanding clinical risk of acute events. While their longitudinal nature presents opportunities to observe changing risk over time, these analyses are complicated by the sparse and irregular measurements of many of the clinical metrics making typical statistical methods unsuitable for these data. In this paper, we present an analytic procedure to both sample from an EHR and analyze the data to detect clinically meaningful markers of acute myocardial infarction (MI). Using an EHR from a large national dialysis organization we abstracted the records of 64,318 individuals and identified 5,314 people that had an MI during the study period. We describe a nested case-control design to sample appropriate controls and an analytic approach using regression splines. Fitting a mixed-model with truncated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Statistical Methods in Clinical Trials · Biomedical Text Mining and Ontologies
