LMFPPO-UBP: Local Mean Field Proximal Policy Optimization with Unbalanced Punishment for Spatial Public Goods Games
Jinshuo Yang, Zhaoqilin Yang, Wenjie Zhou, Xin Wang, Youliang Tian

TL;DR
This paper introduces LMFPPO-UBP, a novel reinforcement learning framework that enhances cooperation in spatial public goods games by incorporating local mean-field dynamics and unbalanced punishment mechanisms.
Contribution
It reformulates the mean field as a socio-statistical sensor within policy gradients and integrates unbalanced punishment to effectively promote cooperation.
Findings
Outperforms baseline methods like Q-learning and Fermi update rules.
Promotes rapid and stable cooperation under low enhancement factors.
Reduces the cooperation threshold and improves coordination.
Abstract
Spatial public goods games are characterized by high-dimensional state spaces and localized externalities, which pose significant challenges for achieving stable and widespread cooperation. Traditional approaches often struggle to effectively capture neighborhood-level strategic interactions and dynamically align individual incentives with collective welfare. To resolve this issue, this paper introduces a novel intelligent decision-making framework called Local Mean-Field Proximal Policy Optimization with Unbalanced Punishment (LMFPPO-UBP). The conventional mean field concept is reformulated as a socio-statistical sensor embedded directly into the policy gradient space of deep reinforcement learning, allowing agents to adapt their strategies based on mesoscale neighborhood dynamics. Additionally, an unbalanced punishment mechanism is integrated to penalize defectors proportionally to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Reinforcement Learning in Robotics · Game Theory and Applications
