A Q-values Sharing Framework for Multiagent Reinforcement Learning under   Budget Constraint

Changxi Zhu; Ho-fung Leung; Shuyue Hu; Yi Cai

arXiv:2011.14281·cs.MA·November 10, 2021

A Q-values Sharing Framework for Multiagent Reinforcement Learning under Budget Constraint

Changxi Zhu, Ho-fung Leung, Shuyue Hu, Yi Cai

PDF

1 Repo

TL;DR

This paper introduces PSAF, a Q-values sharing framework for cooperative multiagent reinforcement learning that improves learning efficiency under communication budget constraints by enabling agents to decide when to ask for and share Q-values.

Contribution

The paper proposes a novel PSAF framework allowing agents to adaptively share and request Q-values, enhancing cooperation under limited communication budgets.

Findings

01

PSAF outperforms existing advising methods in various multiagent tasks.

02

Sharing Q-values improves learning efficiency and cooperation.

03

Adaptive advising reduces communication costs while maintaining performance.

Abstract

In teacher-student framework, a more experienced agent (teacher) helps accelerate the learning of another agent (student) by suggesting actions to take in certain states. In cooperative multiagent reinforcement learning (MARL), where agents need to cooperate with one another, a student may fail to cooperate well with others even by following the teachers' suggested actions, as the polices of all agents are ever changing before convergence. When the number of times that agents communicate with one another is limited (i.e., there is budget constraint), the advising strategy that uses actions as advices may not be good enough. We propose a partaker-sharer advising framework (PSAF) for cooperative MARL agents learning with budget constraint. In PSAF, each Q-learner can decide when to ask for Q-values and share its Q-values. We perform experiments in three typical multiagent learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jklimack/Multi-Agent-Reinforcement-Learning-for-the-Predator-Prey-Game
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.