TL;DR
This paper introduces an approximate dynamic programming method using reinforcement learning to optimize elective patient admission, balancing waiting times and resource utilization efficiently in large-scale healthcare settings.
Contribution
It develops a novel ADP algorithm with structural analysis and reinforcement learning for large-scale MDPs in healthcare admission control.
Findings
Algorithms significantly reduce computation time.
Capable of near-optimal policy computation for large problems.
Effective in balancing patient wait times and resource use.
Abstract
In this paper, we propose an approximate dynamic programming (ADP) algorithm to solve a Markov decision process (MDP) formulation for the admission control of elective patients. To manage the elective patients from multiple specialties equitably and efficiently, we establish a waiting list and assign each patient a time-dependent dynamic priority score. Then, taking the random arrivals of patients into account, sequential decisions are made on a weekly basis. At the end of each week, we select the patients to be treated in the following week from the waiting list. By minimizing the cost function of the MDP over an infinite horizon, we seek to achieve the best trade-off between the patients' waiting times and the over-utilization of surgical resources. Considering the curses of dimensionality resulting from the large scale of realistically sized problems, we first analyze the structural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
