Reinforcement learning with reputation-based adaptive exploration promotes the evolution of cooperation

An Li; Wenqiang Zhu; Chaoqian Wang; Longzhao Liu; Hongwei Zheng; Yishen Jiang; Xin Wang; Shaoting Tang

arXiv:2604.08103·physics.comp-ph·April 10, 2026

Reinforcement learning with reputation-based adaptive exploration promotes the evolution of cooperation

An Li, Wenqiang Zhu, Chaoqian Wang, Longzhao Liu, Hongwei Zheng, Yishen Jiang, Xin Wang, Shaoting Tang

PDF

TL;DR

This paper introduces a reputation-based adaptive exploration mechanism in multi-agent reinforcement learning, which promotes cooperation by adjusting exploration based on social reputation and status.

Contribution

It proposes a novel Q-learning model coupling exploration rates with reputation differences and asymmetric reputation updates, enhancing cooperation in social environments.

Findings

01

Each mechanism independently promotes cooperation.

02

Combining mechanisms yields a reinforcing effect on cooperation.

03

Reputation-based adjustments improve learning outcomes in social contexts.

Abstract

Multi-agent reinforcement learning serves as an effective tool for studying strategy adaptation in evolutionary games. Although prior work has integrated Q-learning with reputation mechanisms to promote cooperation, most existing algorithms adopt fixed exploration rates and overlook the influence of social context on exploratory behavior. In practice, individuals may adjust their willingness to explore based on their reputation and perceived social standing. To address this, we propose a Q-learning model that couples exploration rates with local reputation differences and incorporates asymmetric, state-dependent reputation updates. Our results show that each mechanism independently promotes cooperation, and their combination yields a reinforcing effect. The joint mechanism enhances cooperation by making ``high reputation--low exploration, low reputation--high exploration'', while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.