From Data Lifting to Continuous Risk Estimation: A Process-Aware Pipeline for Predictive Monitoring of Clinical Pathways
Pasquale Ardimento, Mario Luca Bernardi, Marta Cimitile, Samuele Latorre

TL;DR
This paper introduces a process-aware pipeline for continuous predictive monitoring of clinical pathways, demonstrating improved early risk estimation as patient data accumulates.
Contribution
It presents a novel, reproducible framework integrating data lifting, temporal reconstruction, and predictive modeling for healthcare process monitoring.
Findings
Predictive performance improves as more clinical events are observed.
Logistic Regression achieves an AUC of 0.906 in predicting ICU admission.
Early-stage predictions have significantly lower accuracy, improving over pathway progression.
Abstract
This paper presents a reproducible and process-aware pipeline for predictive monitoring of clinical pathways. The approach integrates data lifting, temporal reconstruction, event log construction, prefix-based representations, and predictive modeling to support continuous reasoning on partially observed patient trajectories, overcoming the limitations of traditional retrospective process mining. The framework is evaluated on COVID-19 clinical pathways using ICU admission as the prediction target, considering 4,479 patient cases and 46,804 prefixes. Predictive models are trained and evaluated using a case-level split, with 896 patients in the test set. Logistic Regression achieves the best performance (AUC 0.906, F1-score 0.835). A detailed prefix-based analysis shows that predictive performance improves progressively as new clinical events become available, with AUC increasing from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
