Learning Altruistic Behaviours in Reinforcement Learning without   External Rewards

Tim Franzmeyer; Mateusz Malinowski; Jo\~ao F. Henriques

arXiv:2107.09598·cs.AI·March 22, 2022·1 cites

Learning Altruistic Behaviours in Reinforcement Learning without External Rewards

Tim Franzmeyer, Mateusz Malinowski, Jo\~ao F. Henriques

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel reinforcement learning approach where agents learn altruistic behaviors by increasing other agents' options without external rewards, enabling task-agnostic cooperation.

Contribution

It formalizes a new altruistic learning framework based on maximizing other agents' future choices, without relying on explicit goal knowledge or external rewards.

Findings

01

Agents perform comparably to explicitly cooperative agents.

02

Altruistic behavior improves success in multi-agent environments.

03

Unsupervised altruistic agents can outperform supervised counterparts.

Abstract

Can artificial agents learn to assist others in achieving their goals without knowing what those goals are? Generic reinforcement learning agents could be trained to behave altruistically towards others by rewarding them for altruistic behaviour, i.e., rewarding them for benefiting other agents in a given situation. Such an approach assumes that other agents' goals are known so that the altruistic agent can cooperate in achieving those goals. However, explicit knowledge of other agents' goals is often difficult to acquire. In the case of human agents, their goals and preferences may be difficult to express fully; they might be ambiguous or even contradictory. Thus, it is beneficial to develop agents that do not depend on external supervision and learn altruistic behaviour in a task-agnostic manner. We propose to act altruistically towards other agents by giving them more choice and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning Altruistic Behaviours in Reinforcement Learning without External Rewards· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Data Stream Mining Techniques