Kindness in Multi-Agent Reinforcement Learning

Farinaz Alamiyan-Harandi; Mersad Hassanjani; Pouria Ramazi

arXiv:2311.04239·cs.AI·November 9, 2023·1 cites

Kindness in Multi-Agent Reinforcement Learning

Farinaz Alamiyan-Harandi, Mersad Hassanjani, Pouria Ramazi

PDF

Open Access

TL;DR

This paper introduces KindMARL, a novel multi-agent reinforcement learning method that incorporates fairness and kindness by evaluating agents' intentions through counterfactual reasoning, leading to improved cooperation and higher rewards.

Contribution

The paper proposes a new approach to MARL that models kindness via intention measurement using counterfactual analysis, enhancing cooperative behavior among agents.

Findings

01

Agents trained with KindMARL outperform existing methods in reward accumulation.

02

KindMARL significantly improves cooperation in environmental tasks.

03

Experimental results show increased total rewards in multiple environments.

Abstract

In human societies, people often incorporate fairness in their decisions and treat reciprocally by being kind to those who act kindly. They evaluate the kindness of others' actions not only by monitoring the outcomes but also by considering the intentions. This behavioral concept can be adapted to train cooperative agents in Multi-Agent Reinforcement Learning (MARL). We propose the KindMARL method, where agents' intentions are measured by counterfactual reasoning over the environmental impact of the actions that were available to the agents. More specifically, the current environment state is compared with the estimation of the current environment state provided that the agent had chosen another action. The difference between each agent's reward, as the outcome of its action, with that of its fellow, multiplied by the intention of the fellow is then taken as the fellow's "kindness". If…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Behavioral Economics Studies