Deep Reinforcement Learning for Time-Critical Wilderness Search And Rescue Using Drones
Jan-Hendrik Ewers, David Anderson, Douglas Thomson

TL;DR
This paper presents a deep reinforcement learning approach for drone-based wilderness search and rescue, significantly improving search efficiency by learning optimal flight paths using probabilistic data.
Contribution
It introduces a novel deep reinforcement learning method with continuous action space for drone search path optimization in wilderness rescue scenarios.
Findings
Achieves over 160% faster search times compared to traditional methods.
Incorporates probabilistic area data to enhance search efficiency.
Utilizes continuous action space for more nuanced drone flight patterns.
Abstract
Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial. This paper explores the use of deep reinforcement learning to create efficient search missions for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the deep reinforcement learning agent to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms. In one comparison, deep reinforcement learning is found to outperform other algorithms by over , a difference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · UAV Applications and Optimization
