Multi-Agent Reinforcement Learning with Hierarchical Coordination for   Emergency Responder Stationing

Amutheezan Sivagnanam; Ava Pettet; Hunter Lee; Ayan Mukhopadhyay,; Abhishek Dubey; Aron Laszka

arXiv:2405.13205·cs.LG·June 11, 2024

Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing

Amutheezan Sivagnanam, Ava Pettet, Hunter Lee, Ayan Mukhopadhyay,, Abhishek Dubey, Aron Laszka

PDF

Open Access

TL;DR

This paper introduces a novel hierarchical reinforcement learning method with transformers for emergency responder stationing, significantly reducing decision time and slightly improving response times compared to existing approaches.

Contribution

It replaces online search with learned policies using transformers and actor-critic methods, enabling faster and effective emergency responder repositioning.

Findings

01

Reduces decision computation time by 1000x

02

Slightly decreases ambulance response time by 5 seconds

03

Validated on real-world data from Nashville and Seattle

Abstract

An emergency responder management (ERM) system dispatches responders, such as ambulances, when it receives requests for medical aid. ERM systems can also proactively reposition responders between predesignated waiting locations to cover any gaps that arise due to the prior dispatch of responders or significant changes in the distribution of anticipated requests. Optimal repositioning is computationally challenging due to the exponential number of ways to allocate responders between locations and the uncertainty in future requests. The state-of-the-art approach in proactive repositioning is a hierarchical approach based on spatial decomposition and online Monte Carlo tree search, which may require minutes of computation for each decision in a domain where seconds can save lives. We address the issue of long decision times by introducing a novel reinforcement learning (RL) approach, based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvacuation and Crowd Dynamics