Multi-Agent Generative Adversarial Interactive Self-Imitation Learning   for AUV Formation Control and Obstacle Avoidance

Zheng Fang; Tianhao Chen; Dong Jiang; Zheng Zhang; Guangliang Li

arXiv:2401.11378·cs.RO·January 23, 2024·1 cites

Multi-Agent Generative Adversarial Interactive Self-Imitation Learning for AUV Formation Control and Obstacle Avoidance

Zheng Fang, Tianhao Chen, Dong Jiang, Zheng Zhang, Guangliang Li

PDF

Open Access

TL;DR

This paper introduces MAGAISIL, a novel learning method for multi-AUVs that improves formation control and obstacle avoidance by iteratively replacing sub-optimal demonstrations with self-generated trajectories, surpassing prior imitation learning methods.

Contribution

The paper proposes MAGAISIL, an enhanced multi-agent imitation learning algorithm that enables AUVs to improve policies beyond initial demonstrations through self-imitation and human guidance.

Findings

01

MAGAISIL outperforms MAGAIL with sub-optimal demonstrations.

02

AUV policies trained with MAGAISIL match or exceed those trained with optimal demonstrations.

03

The method adapts well to complex, varied tasks.

Abstract

Multiple autonomous underwater vehicles (multi-AUV) can cooperatively accomplish tasks that a single AUV cannot complete. Recently, multi-agent reinforcement learning has been introduced to control of multi-AUV. However, designing efficient reward functions for various tasks of multi-AUV control is difficult or even impractical. Multi-agent generative adversarial imitation learning (MAGAIL) allows multi-AUV to learn from expert demonstration instead of pre-defined reward functions, but suffers from the deficiency of requiring optimal demonstrations and not surpassing provided expert demonstrations. This paper builds upon the MAGAIL algorithm by proposing multi-agent generative adversarial interactive self-imitation learning (MAGAISIL), which can facilitate AUVs to learn policies by gradually replacing the provided sub-optimal demonstrations with self-generated good trajectories selected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUnderwater Vehicles and Communication Systems · Reinforcement Learning in Robotics · Robot Manipulation and Learning