Improved Reinforcement Learning in Cooperative Multi-agent Environments Using Knowledge Transfer
Mahnoosh Mahdavimoghaddam, Amin Nikanjam, Monireh Abdoos

TL;DR
This paper introduces a communication framework with a state reduction method and a knowledge transfer algorithm to improve reinforcement learning efficiency in cooperative multi-agent systems, demonstrated on a shepherding task.
Contribution
It presents a novel framework combining state space reduction and knowledge transfer to enhance learning speed and cooperation among agents in complex environments.
Findings
Accelerated learning process with knowledge transfer
Reduced state space size through state abstraction
Effective cooperation in multi-agent tasks
Abstract
Nowadays, cooperative multi-agent systems are used to learn how to achieve goals in large-scale dynamic environments. However, learning in these environments is challenging: from the effect of search space size on learning time to inefficient cooperation among agents. Moreover, reinforcement learning algorithms may suffer from a long time of convergence in such environments. In this paper, a communication framework is introduced. In the proposed communication framework, agents learn to cooperate effectively and also by introduction of a new state calculation method the size of state space will decline considerably. Furthermore, a knowledge-transferring algorithm is presented to share the gained experiences among the different agents, and develop an effective knowledge-fusing mechanism to fuse the knowledge learnt utilizing the agents' own experiences with the knowledge received from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems · Evolutionary Algorithms and Applications
