Collaborative Target Search with a Visual Drone Swarm: An Adaptive   Curriculum Embedded Multistage Reinforcement Learning Approach

Jiaping Xiao; Phumrapee Pisutsin; Mir Feroskhan

arXiv:2204.12181·cs.RO·November 28, 2023·1 cites

Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach

Jiaping Xiao, Phumrapee Pisutsin, Mir Feroskhan

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel adaptive curriculum multistage reinforcement learning approach for collaborative target search using visual drone swarms, enabling efficient training and real-world deployment without fine-tuning.

Contribution

It proposes ACEMSL, a data-efficient, multistage RL method with adaptive curriculum for collaborative drone search, addressing sparse rewards and visual perception challenges.

Findings

01

Effective in simulation and real-world tests

02

Enables deployment without fine-tuning

03

Improves collaboration and obstacle avoidance

Abstract

Equipping drones with target search capabilities is highly desirable for applications in disaster rescue and smart warehouse delivery systems. Multiple intelligent drones that can collaborate with each other and maneuver among obstacles show more effectiveness in accomplishing tasks in a shorter amount of time. However, carrying out collaborative target search (CTS) without prior target information is extremely challenging, especially with a visual drone swarm. In this work, we propose a novel data-efficient deep reinforcement learning (DRL) approach called adaptive curriculum embedded multistage learning (ACEMSL) to address these challenges, mainly 3-D sparse reward space exploration with limited visual perception and collaborative behavior requirements. Specifically, we decompose the CTS task into several subtasks including individual obstacle avoidance, target search, and inter-agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · UAV Applications and Optimization · Reinforcement Learning in Robotics