Inpatient Overflow Management with Proximal Policy Optimization

Jingjing Sun; Jim Dai; Pengyi Shi

arXiv:2410.13767·math.OC·May 8, 2026

Inpatient Overflow Management with Proximal Policy Optimization

Jingjing Sun, Jim Dai, Pengyi Shi

PDF

TL;DR

This paper presents a scalable reinforcement learning framework using Proximal Policy Optimization to optimize inpatient overflow management in large hospital systems, addressing complex patient-unit assignment challenges.

Contribution

It introduces atomic actions, a partially-shared policy network, and queueing-informed value functions to improve scalability and efficiency in hospital overflow decision-making.

Findings

01

Outperforms existing benchmarks in large hospital systems.

02

Reduces simulation data requirements significantly.

03

Handles up to twenty patient classes and wards effectively.

Abstract

Problem Definition: Managing inpatient flow in large hospital systems is challenging due to the complexity of assigning randomly arriving patients -- either waiting for primary units or being overflowed to alternative units. Current practices rely on ad-hoc rules, while prior analytical approaches struggle with the intractably large state and action spaces inherent in patient-unit matching. Scalable decision support is needed to optimize overflow management while accounting for time-periodic fluctuations in patient flow. Methodology/Results: We develop a scalable decision-making framework using Proximal Policy Optimization (PPO) to optimize overflow decisions in a time-periodic, long-run average cost setting. To address the combinatorial complexity, we introduce atomic actions, which decompose multi-patient routing into sequential assignments. We further enhance computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.