The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning
Enyu Zhao, Chanjuan Liu, Houfu Su, Yang Liu

TL;DR
This paper introduces two improved multi-agent reinforcement learning algorithms, MADDPG-LSTMactor and MADDPG-L, enhancing obstacle avoidance in multi-agent systems by incorporating temporal information and scalability features.
Contribution
The paper proposes novel modifications to MADDPG, integrating LSTM for temporal data processing and simplifying critic inputs to improve scalability and performance.
Findings
MADDPG-LSTMactor outperforms in small-agent scenarios.
MADDPG-L shows better results with many agents.
Both algorithms demonstrate improved obstacle avoidance capabilities.
Abstract
Intelligence agents and multi-agent systems play important roles in scenes like the control system of grouped drones, and multi-agent navigation and obstacle avoidance which is the foundational function of advanced application has great importance. In multi-agent navigation and obstacle avoidance tasks, the decision-making interactions and dynamic changes of agents are difficult for traditional route planning algorithms or reinforcement learning algorithms with the increased complexity of the environment. The classical multi-agent reinforcement learning algorithm, Multi-agent deep deterministic policy gradient(MADDPG), solved precedent algorithms' problems of having unstationary training process and unable to deal with environment randomness. However, MADDPG ignored the temporal message hidden beneath agents' interaction with the environment. Besides, due to its CTDE technique which let…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics
