When Validation Fails: Cross-Institutional Blood Pressure Prediction and the Limits of Electronic Health Record-Based Models
Md Basit Azam, Sarangthem Ibotombi Singh

TL;DR
This study reveals significant challenges in applying blood pressure prediction models across different healthcare institutions, emphasizing the importance of external validation and identifying key barriers to generalizability.
Contribution
We developed an ensemble framework for blood pressure prediction from EHRs and systematically analyzed cross-institutional validation failures, highlighting fundamental barriers to model transferability.
Findings
Internal validation showed moderate performance.
External validation revealed substantial generalization issues.
Feature distribution and patient population differences hinder transferability.
Abstract
External validation remains rare in healthcare machine learning despite being critical for establishing real-world feasibility. We developed an ensemble framework to predict blood pressure from electronic health records, incorporating rigorous data leakage prevention. Internal validation on the MIMIC-III dataset yielded moderate performance for systolic (R^2 = 0.248, RMSE = 14.84 mmHg) and diastolic (R^2 = 0.297, RMSE = 8.27 mmHg) blood pressure. However, external validation on the eICU dataset revealed substantial generalization challenges. Baseline systolic performance dropped significantly from R^2 = 0.248 to -0.024, with RMSE increasing from 14.84 to 18.69 mmHg. To address potential confounding from feature imputation, we conducted an intersection-only experiment using 16 universally available features; this yielded worse external performance (R^2 = -0.115, RMSE = 17.32 mmHg),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
