Efficient Domain Coverage for Vehicles with Second-Order Dynamics via   Multi-Agent Reinforcement Learning

Xinyu Zhao; Razvan C. Fetecau; Mo Chen

arXiv:2211.05952·cs.RO·October 17, 2023

Efficient Domain Coverage for Vehicles with Second-Order Dynamics via Multi-Agent Reinforcement Learning

Xinyu Zhao, Razvan C. Fetecau, Mo Chen

PDF

Open Access

TL;DR

This paper introduces a multi-agent reinforcement learning approach using MAPPO with LSTM and self-attention for efficient area coverage by vehicles with second-order dynamics, outperforming classical control methods.

Contribution

The paper presents a novel RL-based method with a specialized network architecture for multi-agent coverage, handling variable agent numbers and surpassing traditional control policies.

Findings

01

RL approach outperforms classical control policies in simulations

02

Incorporation of LSTM and self-attention improves adaptability to agent number

03

Method demonstrates significant efficiency in simulated coverage tasks

Abstract

Collaborative autonomous multi-agent systems covering a specified area have many potential applications, such as UAV search and rescue, forest fire fighting, and real-time high-resolution monitoring. Traditional approaches for such coverage problems involve designing a model-based control policy based on sensor data. However, designing model-based controllers is challenging, and the state-of-the-art classical control policy still exhibits a large degree of sub-optimality. In this paper, we present a reinforcement learning (RL) approach for the multi-agent efficient domain coverage problem involving agents with second-order dynamics. Our approach is based on the Multi-Agent Proximal Policy Optimization Algorithm (MAPPO). Our proposed network architecture includes the incorporation of LSTM and self-attention, which allows the trained policy to adapt to a variable number of agents. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuel Cells and Related Materials · Reinforcement Learning in Robotics

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory