Temporal Data Requirement for Predicting Unplanned Hospital Readmissions
Ramin Mohammadi, Vahab vahdat, Sarthak Jain, Amir T. Namin, Ramya Palacholla, Sagar Kamarthi

TL;DR
This study identifies optimal historical data time windows for predicting 30-day hospital readmissions after hip and knee surgeries, revealing modality-specific temporal patterns that challenge the assumption that more data always improves predictions.
Contribution
It provides new insights into the temporal data requirements for structured and unstructured clinical data, guiding better model design for readmission prediction.
Findings
Optimal note data window is 3-6 months prior to surgery.
Structured data performance plateaus after 12 months.
Modality-specific temporal patterns are consistent across models.
Abstract
With the proliferation of Electronic Health Records (EHRs), a critical challenge in building predictive models is determining the optimal historical data time window to maximize accuracy. This study investigates the impact of various observation windows ranging from the day of surgery to three years prior on predicting 30-day readmission following hip and knee arthroplasties. The dataset encompasses both structured encounter records (over 4 million) and unstructured clinical notes (80,000) from 7,174 patients. To extract meaning from the clinical notes, we employed a suite of non neural (BOW, count BOW, TF IDF, LDA) and neural encoders (BERT, 1D CNN, BiLSTM, Average). We subsequently evaluated models utilizing clinical notes alone, structured data alone, and a combination of both modalities. Our results demonstrate that the optimal time window for unstructured clinical notes is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
