Double Q-Learning for Citizen Relocation During Natural Hazards
Alysson Ribeiro da Silva

TL;DR
This paper explores using Double Q-learning within a POMDP framework to enable autonomous robots to relocate citizens during natural hazards, demonstrating variable success rates depending on scenario difficulty.
Contribution
It introduces a novel application of Double Q-learning for citizen relocation in disaster scenarios using a grid world simulation.
Findings
Success rate above 100% in easy scenarios
Near 50% success rate in hard scenarios
Demonstrates potential of RL in autonomous disaster response
Abstract
Natural disasters can cause substantial negative socio-economic impacts around the world, due to mortality, relocation, rates, and reconstruction decisions. Robotics has been successfully applied to identify and rescue victims during the occurrence of a natural hazard. However, little effort has been taken to deploy solutions where an autonomous robot can save the life of a citizen by itself relocating it, without the need to wait for a rescue team composed of humans. Reinforcement learning approaches can be used to deploy such a solution, however, one of the most famous algorithms to deploy it, the Q-learning, suffers from biased results generated when performing its learning routines. In this research a solution for citizen relocation based on Partially Observable Markov Decision Processes is adopted, where the capability of the Double Q-learning in relocating citizens during a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDisaster Response and Management · Facility Location and Emergency Management · Disaster Management and Resilience
MethodsDouble Q-learning · Q-Learning
