Towards Empathic Deep Q-Learning
Bart Bussmann, Jacqueline Heinerman, Joel Lehman

TL;DR
This paper introduces Empathic DQN, an extension of Deep Q-Networks that incorporates empathy-inspired considerations to reduce negative side effects on other agents in multi-agent reinforcement learning environments.
Contribution
It proposes a novel empathic extension to DQNs that considers other agents' perspectives, aiming to promote ethical behavior and mitigate collateral harm.
Findings
Empathic DQN reduces negative side effects in gridworld environments.
The approach demonstrates potential for integrating machine ethics into reinforcement learning.
Initial results suggest improved alignment with ethical considerations.
Abstract
As reinforcement learning (RL) scales to solve increasingly complex tasks, interest continues to grow in the fields of AI safety and machine ethics. As a contribution to these fields, this paper introduces an extension to Deep Q-Networks (DQNs), called Empathic DQN, that is loosely inspired both by empathy and the golden rule ("Do unto others as you would have them do unto you"). Empathic DQN aims to help mitigate negative side effects to other agents resulting from myopic goal-directed behavior. We assume a setting where a learning agent coexists with other independent agents (who receive unknown rewards), where some types of reward (e.g. negative rewards from physical harm) may generalize across agents. Empathic DQN combines the typical (self-centered) value with the estimated value of other agents, by imagining (by its own standards) the value of it being in the other's situation (by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Reinforcement Learning in Robotics
MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network
