Inpatient Overflow Management with Proximal Policy Optimization
Jingjing Sun, Jim Dai, Pengyi Shi

TL;DR
This paper presents a scalable reinforcement learning framework using Proximal Policy Optimization to optimize inpatient overflow management in large hospital systems, addressing complex patient-unit assignment challenges.
Contribution
It introduces atomic actions, a partially-shared policy network, and queueing-informed value functions to improve scalability and efficiency in hospital overflow decision-making.
Findings
Outperforms existing benchmarks in large hospital systems.
Reduces simulation data requirements significantly.
Handles up to twenty patient classes and wards effectively.
Abstract
Problem Definition: Managing inpatient flow in large hospital systems is challenging due to the complexity of assigning randomly arriving patients -- either waiting for primary units or being overflowed to alternative units. Current practices rely on ad-hoc rules, while prior analytical approaches struggle with the intractably large state and action spaces inherent in patient-unit matching. Scalable decision support is needed to optimize overflow management while accounting for time-periodic fluctuations in patient flow. Methodology/Results: We develop a scalable decision-making framework using Proximal Policy Optimization (PPO) to optimize overflow decisions in a time-periodic, long-run average cost setting. To address the combinatorial complexity, we introduce atomic actions, which decompose multi-patient routing into sequential assignments. We further enhance computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
