Online bipartite matching methodology for anti-epidemic resources allocation: an adaptive time window based on reinforcement learning
Zhiyong Wu, Sulin Pang, Suyan He

TL;DR
This paper proposes a reinforcement learning-based method to dynamically allocate anti-epidemic resources among suppliers and recipients during outbreaks.
Contribution
The novel contribution is an adaptive time window bipartite matching algorithm using reinforcement learning for dynamic anti-epidemic resource allocation.
Findings
Adaptive time window strategies better adapt to dynamic epidemic scenarios compared to fixed window approaches.
Matching rates increase with larger windows, but waiting times initially decrease then increase.
Health managers should adjust time windows based on epidemic dynamics and resource availability.
Abstract
This study aimed to investigate the online matching problem for anti-epidemic resources among multiple suppliers and recipients in the Internet of Healthcare System during a major outbreak. It accounts for the heterogeneity of supply and demand. A multi-stage online dynamic bipartite matching model based on time windows is developed, which can be reformulated as a Markov decision process. An adaptive time window batch bipartite matching algorithm based on reinforcement learning is proposed, which utilizes the nearest neighbor's first heuristic strategy to allocate anti-epidemic resources. The optimal window size in fixed time window batch matching strategy (FTWBM) outperforms the results of adaptive time window batch matching strategy (ATWBM). However, the ATWBM strategy demonstrates greater effectiveness in adapting to the dynamic changes in epidemic prevention and control,…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Advanced Queuing Theory Analysis · Supply Chain and Inventory Management
