Learning Altruistic Behaviours in Reinforcement Learning without External Rewards
Tim Franzmeyer, Mateusz Malinowski, Jo\~ao F. Henriques

TL;DR
This paper introduces a novel reinforcement learning approach where agents learn altruistic behaviors by increasing other agents' options without external rewards, enabling task-agnostic cooperation.
Contribution
It formalizes a new altruistic learning framework based on maximizing other agents' future choices, without relying on explicit goal knowledge or external rewards.
Findings
Agents perform comparably to explicitly cooperative agents.
Altruistic behavior improves success in multi-agent environments.
Unsupervised altruistic agents can outperform supervised counterparts.
Abstract
Can artificial agents learn to assist others in achieving their goals without knowing what those goals are? Generic reinforcement learning agents could be trained to behave altruistically towards others by rewarding them for altruistic behaviour, i.e., rewarding them for benefiting other agents in a given situation. Such an approach assumes that other agents' goals are known so that the altruistic agent can cooperate in achieving those goals. However, explicit knowledge of other agents' goals is often difficult to acquire. In the case of human agents, their goals and preferences may be difficult to express fully; they might be ambiguous or even contradictory. Thus, it is beneficial to develop agents that do not depend on external supervision and learn altruistic behaviour in a task-agnostic manner. We propose to act altruistically towards other agents by giving them more choice and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
