Optimal Dispatch in Emergency Service System via Reinforcement Learning

Cheng Hua; Tauhid Zaman

arXiv:2010.07513·eess.SY·October 16, 2020·1 cites

Optimal Dispatch in Emergency Service System via Reinforcement Learning

Cheng Hua, Tauhid Zaman

PDF

Open Access

TL;DR

This paper models ambulance dispatch as a Markov decision process and introduces reinforcement learning techniques to optimize resource allocation, demonstrating improved policies over traditional methods.

Contribution

It presents a novel reinforcement learning approach using post-decision states for optimal ambulance dispatch with reduced computational complexity.

Findings

01

Temporal-difference policy outperforms myopic policy

02

Proposed methods improve emergency response efficiency

03

Minimal cost required for performance enhancement

Abstract

In the United States, medical responses by fire departments over the last four decades increased by 367%. This had made it critical to decision makers in emergency response departments that existing resources are efficiently used. In this paper, we model the ambulance dispatch problem as an average-cost Markov decision process and present a policy iteration approach to find an optimal dispatch policy. We then propose an alternative formulation using post-decision states that is shown to be mathematically equivalent to the original model, but with a much smaller state space. We present a temporal difference learning approach to the dispatch problem based on the post-decision states. In our numerical experiments, we show that our obtained temporal-difference policy outperforms the benchmark myopic policy. Our findings suggest that emergency response departments can improve their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFacility Location and Emergency Management · Evacuation and Crowd Dynamics · Homelessness and Social Issues