Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing
Yue Jin, Shuangqing Wei, Giovanni Montana

TL;DR
This paper introduces a novel multi-agent reinforcement learning method where agents share action suggestions to promote collective welfare, reducing privacy concerns and avoiding complex reward design, supported by theoretical analysis and competitive experiments.
Contribution
The paper proposes a new MARL approach based on sharing action suggestions, offering an alternative to reward or policy sharing with theoretical guarantees.
Findings
Effective cooperation achieved without sharing rewards or policies
Theoretical bounds on the discrepancy between individual and collective objectives
Competitive performance demonstrated in experiments
Abstract
In human society, the conflict between self-interest and collective well-being often obstructs efforts to achieve shared welfare. Related concepts like the Tragedy of the Commons and Social Dilemmas frequently manifest in our daily lives. As artificial agents increasingly serve as autonomous proxies for humans, we propose a novel multi-agent reinforcement learning (MARL) method to address this issue - learning policies to maximise collective returns even when individual agents' interests conflict with the collective one. Unlike traditional cooperative MARL solutions that involve sharing rewards, values, and policies or designing intrinsic rewards to encourage agents to learn collectively optimal policies, we propose a novel MARL approach where agents exchange action suggestions. Our method reveals less private information compared to sharing rewards, values, or policies, while enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation
MethodsALIGN
