Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a   Swarm of Buoys

Maryam Kouzehgar; Malika Meghjani; Roland Bouffanais

arXiv:2012.11641·cs.RO·July 19, 2022

Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys

Maryam Kouzehgar, Malika Meghjani, Roland Bouffanais

PDF

TL;DR

This paper introduces two multi-agent reinforcement learning methods for dynamic ocean monitoring using a swarm of autonomous buoys, improving area coverage efficiency in non-stationary environments.

Contribution

It presents a novel structured MARL approach for marine area coverage, including a modified MADDPG method that enhances collective behavior and coverage performance.

Findings

01

Coverage-range-based MARL outperforms swarm-based MARL in convergence and coverage.

02

Both MARL methods outperform naive swarming strategies.

03

Structured learning improves adaptability in non-stationary environments.

Abstract

Autonomous marine environmental monitoring problem traditionally encompasses an area coverage problem which can only be effectively carried out by a multi-robot system. In this paper, we focus on robotic swarms that are typically operated and controlled by means of simple swarming behaviors obtained from a subtle, yet ad hoc combination of bio-inspired strategies. We propose a novel and structured approach for area coverage using multi-agent reinforcement learning (MARL) which effectively deals with the non-stationarity of environmental features. Specifically, we propose two dynamic area coverage approaches: (1) swarm-based MARL, and (2) coverage-range-based MARL. The former is trained using the multi-agent deep deterministic policy gradient (MADDPG) approach whereas, a modified version of MADDPG is introduced for the latter with a reward function that intrinsically leads to a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.