Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing
Amutheezan Sivagnanam, Ava Pettet, Hunter Lee, Ayan Mukhopadhyay,, Abhishek Dubey, Aron Laszka

TL;DR
This paper introduces a novel hierarchical reinforcement learning method with transformers for emergency responder stationing, significantly reducing decision time and slightly improving response times compared to existing approaches.
Contribution
It replaces online search with learned policies using transformers and actor-critic methods, enabling faster and effective emergency responder repositioning.
Findings
Reduces decision computation time by 1000x
Slightly decreases ambulance response time by 5 seconds
Validated on real-world data from Nashville and Seattle
Abstract
An emergency responder management (ERM) system dispatches responders, such as ambulances, when it receives requests for medical aid. ERM systems can also proactively reposition responders between predesignated waiting locations to cover any gaps that arise due to the prior dispatch of responders or significant changes in the distribution of anticipated requests. Optimal repositioning is computationally challenging due to the exponential number of ways to allocate responders between locations and the uncertainty in future requests. The state-of-the-art approach in proactive repositioning is a hierarchical approach based on spatial decomposition and online Monte Carlo tree search, which may require minutes of computation for each decision in a domain where seconds can save lives. We address the issue of long decision times by introducing a novel reinforcement learning (RL) approach, based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics
