Recurrent Auto-Encoders for Enhanced Deep Reinforcement Learning in Wilderness Search and Rescue Planning
Jan-Hendrik Ewers, David Anderson, Douglas Thomson

TL;DR
This paper introduces a novel combination of recurrent autoencoders and deep reinforcement learning to improve search efficiency in large wilderness rescue operations, outperforming existing methods in speed and accuracy.
Contribution
The study presents a new integrated architecture that enhances information throughput and control learning in complex, large-scale search tasks, with significant efficiency gains.
Findings
Proposed architecture outperforms benchmarks in search efficiency
Soft actor-critic achieves best performance among tested algorithms
Model trained faster with fewer parameters than previous approaches
Abstract
Wilderness search and rescue operations are often carried out over vast landscapes. The search efforts, however, must be undertaken in minimum time to maximize the chance of survival of the victim. Whilst the advent of cheap multicopters in recent years has changed the way search operations are handled, it has not solved the challenges of the massive areas at hand. The problem therefore is not one of complete coverage, but one of maximizing the information gathered in the limited time available. In this work we propose that a combination of a recurrent autoencoder and deep reinforcement learning is a more efficient solution to the search problem than previous pure deep reinforcement learning or optimisation approaches. The autoencoder training paradigm efficiently maximizes the information throughput of the encoder into its latent space representation which deep reinforcement learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · UAV Applications and Optimization · Robotic Path Planning Algorithms
