Deep reinforcement learning with a particle dynamics environment applied   to emergency evacuation of a room with obstacles

Yihao Zhang; Zhaojie Chai; George Lykotrafitis

arXiv:2012.00065·cs.LG·March 9, 2021

Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles

Yihao Zhang, Zhaojie Chai, George Lykotrafitis

PDF

TL;DR

This paper develops a deep reinforcement learning approach combined with the social-force model to optimize emergency evacuation strategies in complex environments with obstacles, outperforming traditional models in certain scenarios.

Contribution

It introduces a novel Dyna-Q based reinforcement learning method integrated with social-force modeling to improve evacuation planning in complex obstacle-laden environments.

Findings

01

The method aligns with social-force results in obstacle-free scenarios.

02

It effectively avoids traps in concave obstacle environments.

03

Agents efficiently evacuate through multiple exits using a shared policy.

Abstract

A very successful model for simulating emergency evacuation is the social-force model. At the heart of the model is the self-driven force that is applied to an agent and is directed towards the exit. However, it is not clear if the application of this force results in optimal evacuation, especially in complex environments with obstacles. Here, we develop a deep reinforcement learning algorithm in association with the social force model to train agents to find the fastest evacuation path. During training, we penalize every step of an agent in the room and give zero reward at the exit. We adopt the Dyna-Q learning approach. We first show that in the case of a room without obstacles the resulting self-driven force points directly towards the exit as in the social force model and that the median exit time intervals calculated using the two methods are not significantly different. Then, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.