Selective Exploration and Information Gathering in Search and Rescue Using Hierarchical Learning Guided by Natural Language Input
Dimitrios Panagopoulos, Adolfo Perrusquia, Weisi Guo

TL;DR
This paper presents a hierarchical reinforcement learning system guided by natural language input, enabling robots to efficiently explore disaster areas in search and rescue missions by integrating human-provided information.
Contribution
It introduces a novel framework combining large language models with hierarchical RL to incorporate human input and improve search efficiency in complex environments.
Findings
Enhanced learning efficiency in SAR tasks
Improved decision-making with human guidance
Effective integration of LLMs with RL
Abstract
In recent years, robots and autonomous systems have become increasingly integral to our daily lives, offering solutions to complex problems across various domains. Their application in search and rescue (SAR) operations, however, presents unique challenges. Comprehensively exploring the disaster-stricken area is often infeasible due to the vastness of the terrain, transformed environment, and the time constraints involved. Traditional robotic systems typically operate on predefined search patterns and lack the ability to incorporate and exploit ground truths provided by human stakeholders, which can be the key to speeding up the learning process and enhancing triage. Addressing this gap, we introduce a system that integrates social interaction via large language models (LLMs) with a hierarchical reinforcement learning (HRL) framework. The proposed system is designed to translate verbal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Text and Document Classification Technologies
