Multi-Agent Generative Adversarial Interactive Self-Imitation Learning for AUV Formation Control and Obstacle Avoidance
Zheng Fang, Tianhao Chen, Dong Jiang, Zheng Zhang, Guangliang Li

TL;DR
This paper introduces MAGAISIL, a novel learning method for multi-AUVs that improves formation control and obstacle avoidance by iteratively replacing sub-optimal demonstrations with self-generated trajectories, surpassing prior imitation learning methods.
Contribution
The paper proposes MAGAISIL, an enhanced multi-agent imitation learning algorithm that enables AUVs to improve policies beyond initial demonstrations through self-imitation and human guidance.
Findings
MAGAISIL outperforms MAGAIL with sub-optimal demonstrations.
AUV policies trained with MAGAISIL match or exceed those trained with optimal demonstrations.
The method adapts well to complex, varied tasks.
Abstract
Multiple autonomous underwater vehicles (multi-AUV) can cooperatively accomplish tasks that a single AUV cannot complete. Recently, multi-agent reinforcement learning has been introduced to control of multi-AUV. However, designing efficient reward functions for various tasks of multi-AUV control is difficult or even impractical. Multi-agent generative adversarial imitation learning (MAGAIL) allows multi-AUV to learn from expert demonstration instead of pre-defined reward functions, but suffers from the deficiency of requiring optimal demonstrations and not surpassing provided expert demonstrations. This paper builds upon the MAGAIL algorithm by proposing multi-agent generative adversarial interactive self-imitation learning (MAGAISIL), which can facilitate AUVs to learn policies by gradually replacing the provided sub-optimal demonstrations with self-generated good trajectories selected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUnderwater Vehicles and Communication Systems · Reinforcement Learning in Robotics · Robot Manipulation and Learning
