LMFPPO-UBP: Local Mean Field Proximal Policy Optimization with Unbalanced Punishment for Spatial Public Goods Games

Jinshuo Yang; Zhaoqilin Yang; Wenjie Zhou; Xin Wang; Youliang Tian

arXiv:2602.18696·cs.GT·February 24, 2026

LMFPPO-UBP: Local Mean Field Proximal Policy Optimization with Unbalanced Punishment for Spatial Public Goods Games

Jinshuo Yang, Zhaoqilin Yang, Wenjie Zhou, Xin Wang, Youliang Tian

PDF

Open Access

TL;DR

This paper introduces LMFPPO-UBP, a novel reinforcement learning framework that enhances cooperation in spatial public goods games by incorporating local mean-field dynamics and unbalanced punishment mechanisms.

Contribution

It reformulates the mean field as a socio-statistical sensor within policy gradients and integrates unbalanced punishment to effectively promote cooperation.

Findings

01

Outperforms baseline methods like Q-learning and Fermi update rules.

02

Promotes rapid and stable cooperation under low enhancement factors.

03

Reduces the cooperation threshold and improves coordination.

Abstract

Spatial public goods games are characterized by high-dimensional state spaces and localized externalities, which pose significant challenges for achieving stable and widespread cooperation. Traditional approaches often struggle to effectively capture neighborhood-level strategic interactions and dynamically align individual incentives with collective welfare. To resolve this issue, this paper introduces a novel intelligent decision-making framework called Local Mean-Field Proximal Policy Optimization with Unbalanced Punishment (LMFPPO-UBP). The conventional mean field concept is reformulated as a socio-statistical sensor embedded directly into the policy gradient space of deep reinforcement learning, allowing agents to adapt their strategies based on mesoscale neighborhood dynamics. Additionally, an unbalanced punishment mechanism is integrated to penalize defectors proportionally to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Game Theory and Cooperation · Reinforcement Learning in Robotics · Game Theory and Applications