Recurrent Auto-Encoders for Enhanced Deep Reinforcement Learning in   Wilderness Search and Rescue Planning

Jan-Hendrik Ewers; David Anderson; Douglas Thomson

arXiv:2502.19356·cs.LG·February 27, 2025

Recurrent Auto-Encoders for Enhanced Deep Reinforcement Learning in Wilderness Search and Rescue Planning

Jan-Hendrik Ewers, David Anderson, Douglas Thomson

PDF

Open Access

TL;DR

This paper introduces a novel combination of recurrent autoencoders and deep reinforcement learning to improve search efficiency in large wilderness rescue operations, outperforming existing methods in speed and accuracy.

Contribution

The study presents a new integrated architecture that enhances information throughput and control learning in complex, large-scale search tasks, with significant efficiency gains.

Findings

01

Proposed architecture outperforms benchmarks in search efficiency

02

Soft actor-critic achieves best performance among tested algorithms

03

Model trained faster with fewer parameters than previous approaches

Abstract

Wilderness search and rescue operations are often carried out over vast landscapes. The search efforts, however, must be undertaken in minimum time to maximize the chance of survival of the victim. Whilst the advent of cheap multicopters in recent years has changed the way search operations are handled, it has not solved the challenges of the massive areas at hand. The problem therefore is not one of complete coverage, but one of maximizing the information gathered in the limited time available. In this work we propose that a combination of a recurrent autoencoder and deep reinforcement learning is a more efficient solution to the search problem than previous pure deep reinforcement learning or optimisation approaches. The autoencoder training paradigm efficiently maximizes the information throughput of the encoder into its latent space representation which deep reinforcement learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · UAV Applications and Optimization · Robotic Path Planning Algorithms