Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach
Jiaping Xiao, Phumrapee Pisutsin, Mir Feroskhan

TL;DR
This paper introduces a novel adaptive curriculum multistage reinforcement learning approach for collaborative target search using visual drone swarms, enabling efficient training and real-world deployment without fine-tuning.
Contribution
It proposes ACEMSL, a data-efficient, multistage RL method with adaptive curriculum for collaborative drone search, addressing sparse rewards and visual perception challenges.
Findings
Effective in simulation and real-world tests
Enables deployment without fine-tuning
Improves collaboration and obstacle avoidance
Abstract
Equipping drones with target search capabilities is highly desirable for applications in disaster rescue and smart warehouse delivery systems. Multiple intelligent drones that can collaborate with each other and maneuver among obstacles show more effectiveness in accomplishing tasks in a shorter amount of time. However, carrying out collaborative target search (CTS) without prior target information is extremely challenging, especially with a visual drone swarm. In this work, we propose a novel data-efficient deep reinforcement learning (DRL) approach called adaptive curriculum embedded multistage learning (ACEMSL) to address these challenges, mainly 3-D sparse reward space exploration with limited visual perception and collaborative behavior requirements. Specifically, we decompose the CTS task into several subtasks including individual obstacle avoidance, target search, and inter-agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · UAV Applications and Optimization · Reinforcement Learning in Robotics
