LOPT: Learning Optimal Pigovian Tax in Sequential Social Dilemmas
Yun Hua, Shang Gao, Wenhao Li, Haosheng Chen, Bo Jin, Xiangfeng Wang, Jun Luo, Hongyuan Zha

TL;DR
This paper introduces LOPT, a reinforcement learning method that learns optimal Pigovian taxes to internalize externalities, improving social welfare in multi-agent systems facing social dilemmas.
Contribution
The paper proposes LOPT, a novel approach that learns optimal externality-based taxes to mitigate social dilemmas in multi-agent reinforcement learning.
Findings
LOPT outperforms existing methods in Escape Room environment.
LOPT achieves higher collective social welfare in Cleanup environment.
The method effectively reduces social costs and alleviates social dilemmas.
Abstract
In multi-agent reinforcement learning, each agent acts to maximize its individual accumulated rewards. Nevertheless, individual accumulated rewards could not fully reflect how others perceive them, resulting in selfish behaviors that undermine global performance. The externality theory, defined as ``the activities of one economic actor affect the activities of another in ways that are not reflected in market transactions,'' is applicable to analyze the social dilemmas in MARL. One of its most profound non-market solutions, ``Pigovian Tax'', which internalizes externalities by taxing those who create negative externalities and subsidizing those who create positive externalities, could aid in developing a mechanism to resolve MARL's social dilemmas. The purpose of this paper is to apply externality theory to analyze social dilemmas in MARL. To internalize the externalities in MARL, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Experimental Behavioral Economics Studies · Economic theories and models
