TL;DR
The paper introduces SepsisAI-Orchestrator, an open-source platform that enables scalable deployment and real-time monitoring of AI models for early sepsis detection in clinical settings.
Contribution
It provides a modular infrastructure integrating data preprocessing, model serving, and visualization, with empirical analysis of load scaling on performance.
Findings
Scaling replicas to match CPU threads reduces latency significantly.
Over-provisioning causes scheduler contention and performance degradation.
U-shaped scaling behavior in clinical AI inference workloads is quantified.
Abstract
Despite strong predictive results in the clinical machine learning literature, the translation of these models into bedside use remains limited by systems-level barriers: heterogeneous data representations, the absence of standardized deployment workflows, and a mismatch between research prototypes and the concurrency and latency requirements of hospital environments. We present the SepsisAI-Orchestrator, an open-source modular platform that addresses this deployment gap for early sepsis detection. The platform integrates HL7 FHIR-inspired Clinical Document Architecture (CDA) preprocessing, NoSQL storage, a containerized LightGBM classifier served via REST APIs, and a Streamlit clinical dashboard, orchestrated with Docker and Kubernetes. A previously validated LightGBM model (F1 0.87-0.94 on PhysioNet 2019) is reused without modification; the contribution lies in the surrounding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
