Cooperative and Distributed Reinforcement Learning of Drones for Field   Coverage

Huy Xuan Pham; Hung Manh La; David Feil-Seifer; Aria Nefian

arXiv:1803.07250·cs.RO·September 18, 2018·70 cites

Cooperative and Distributed Reinforcement Learning of Drones for Field Coverage

Huy Xuan Pham, Hung Manh La, David Feil-Seifer, Aria Nefian

PDF

Open Access

TL;DR

This paper introduces a distributed MARL algorithm enabling UAV teams to cooperatively cover unknown fields efficiently, addressing joint-action complexity with game theory and high-dimensional states with function approximation, validated through simulations and real-world tests.

Contribution

It presents a novel MARL approach combining game-theoretic correlated equilibrium and function approximation for UAV coverage tasks, with experimental validation.

Findings

01

UAV teams successfully learn to cover fields with minimal overlap.

02

The algorithm performs well in both simulation and physical experiments.

03

Cooperative learning improves coverage efficiency and reduces redundancy.

Abstract

This paper proposes a distributed Multi-Agent Reinforcement Learning (MARL) algorithm for a team of Unmanned Aerial Vehicles (UAVs). The proposed MARL algorithm allows UAVs to learn cooperatively to provide a full coverage of an unknown field of interest while minimizing the overlapping sections among their field of views. Two challenges in MARL for such a system are discussed in the paper: firstly, the complex dynamic of the joint-actions of the UAV team, that will be solved using game-theoretic correlated equilibrium, and secondly, the challenge in huge dimensional state space representation will be tackled with efficient function approximation techniques. We also provide our experimental results in detail with both simulation and physical implementation to show that the UAV team can successfully learn to accomplish the task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Reinforcement Learning in Robotics · UAV Applications and Optimization