# Incentivising cooperation by judging a group’s performance by its weakest member in neuroevolution and reinforcement learning

**Authors:** Jory Schossau, Bamshad Shirmohammadi, Arend Hintze

PMC · DOI: 10.3389/frobt.2025.1599676 · Frontiers in Robotics and AI · 2025-07-25

## TL;DR

This paper introduces a reward system for autonomous agents that encourages cooperation by basing group rewards on the weakest member's performance.

## Contribution

The novel contribution is using a weakest-member reward metric to promote equitable and cooperative behavior in multi-agent systems.

## Key findings

- The weakest-member reward system promotes equitable behavior among autonomous agents.
- Equitably-minded agents achieve greater stability and higher individual outcomes than selfish agents.
- The approach aligns with biological mechanisms like group-level selection and inclusive fitness theory.

## Abstract

Autonomous agents increasingly interact within social domains such as customer service, transportation, and healthcare, often acting collectively on behalf of humans. In many of these scenarios, individually greedy strategies can diminish overall performance, exemplified by phenomena such as stop-and-go traffic congestion or network service disruptions due to competing interests. Thus, there is a growing need to develop decision-making strategies for autonomous agents that balance individual efficiency with group equitability.

We propose a straightforward approach for rewarding groups of autonomous agents within evolutionary and reinforcement learning frameworks based explicitly on the performance of the weakest member of the group. Rather than optimizing each agent's individual rewards independently, we align incentives by using a “weakest-link” metric, thereby encouraging collective strategies that support equitable outcomes.

Our results demonstrate that this weakest-member reward system effectively promotes equitable behavior among autonomous agents. Agents evolve or learn to balance collective benefit with individual performance, resulting in fairer outcomes for the entire group. Notably, the introduced approach improves overall efficiency, as equitably-minded agents collectively achieve greater stability and higher individual outcomes than agents pursuing purely selfish strategies.

This methodology aligns closely with biological mechanisms observed in nature, specifically group-level selection and inclusive fitness theory. By tying the evolutionary and learning objectives to the group’s weakest member, we mimic natural processes that favor cooperative and equitable behaviors. Our findings highlight the importance of incentive structures that consider the collective well-being to optimize both group fairness and individual agent success. Future research should explore how this reward framework generalizes across broader domains and more complex agent interactions.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12331510/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12331510/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12331510/full.md

---
Source: https://tomesphere.com/paper/PMC12331510