Benchmarking Machine Learning Architectures for Antimicrobial Stewardship in Pediatric ICUs
Niklas Raehse, Luregn J. Schlapbach, Daphn\'e Chopard

TL;DR
This study systematically benchmarks machine learning models for predicting antimicrobial stewardship interventions in pediatric ICUs, revealing that dataset characteristics and target design significantly influence performance.
Contribution
It provides a comprehensive evaluation of different temporal models and insights into model calibration, guiding future development of clinical decision support tools in pediatric AMS.
Findings
Sequence models improve precision-recall at coarse temporal resolution.
Model performance is mainly influenced by target prevalence and dataset features.
Simpler tabular models offer more reliable probability estimates.
Abstract
Antimicrobial stewardship (AMS) is critical in pediatric intensive care units (PICUs), where diagnostic uncertainty often drives broad-spectrum antibiotic use, increasing antimicrobial resistance and potential long-term harms. Machine learning offers a promising approach for identifying patient-level opportunities for stewardship interventions from electronic health record data, yet prior work has focused largely on adult populations and static tabular representations. We present a systematic benchmarking study of AMS intervention prediction in the PICU across a public dataset and a private institutional cohort. We define four clinically relevant proxy targets for reducing antibiotic exposure: intravenous-to-oral switching, de-escalation, discontinuation, and short-course therapy. Under a unified evaluation framework, we compare tabular, sequence-based, and graph-based temporal models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
