Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys
Maryam Kouzehgar, Malika Meghjani, Roland Bouffanais

TL;DR
This paper introduces two multi-agent reinforcement learning methods for dynamic ocean monitoring using a swarm of autonomous buoys, improving area coverage efficiency in non-stationary environments.
Contribution
It presents a novel structured MARL approach for marine area coverage, including a modified MADDPG method that enhances collective behavior and coverage performance.
Findings
Coverage-range-based MARL outperforms swarm-based MARL in convergence and coverage.
Both MARL methods outperform naive swarming strategies.
Structured learning improves adaptability in non-stationary environments.
Abstract
Autonomous marine environmental monitoring problem traditionally encompasses an area coverage problem which can only be effectively carried out by a multi-robot system. In this paper, we focus on robotic swarms that are typically operated and controlled by means of simple swarming behaviors obtained from a subtle, yet ad hoc combination of bio-inspired strategies. We propose a novel and structured approach for area coverage using multi-agent reinforcement learning (MARL) which effectively deals with the non-stationarity of environmental features. Specifically, we propose two dynamic area coverage approaches: (1) swarm-based MARL, and (2) coverage-range-based MARL. The former is trained using the multi-agent deep deterministic policy gradient (MADDPG) approach whereas, a modified version of MADDPG is introduced for the latter with a reward function that intrinsically leads to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
