Universal Empathy and Ethical Bias for Artificial General Intelligence

Alexey Potapov; Sergey Rodionov

arXiv:1308.0702·cs.AI·August 6, 2013

Universal Empathy and Ethical Bias for Artificial General Intelligence

Alexey Potapov, Sergey Rodionov

PDF

Open Access

TL;DR

This paper proposes an extension of the AIXI model for safe AGI that learns values through empathy and ethical bias, demonstrated with experiments in simple environments.

Contribution

It introduces a hierarchical value learning framework for AGI, emphasizing empathy and ethical bias, extending the universal intelligence model.

Findings

01

Feasibility of value learning in simple environments

02

Effective incorporation of empathy and ethical bias

03

Potential for safe AGI development

Abstract

Rational agents are usually built to maximize rewards. However, AGI agents can find undesirable ways of maximizing any prior reward function. Therefore value learning is crucial for safe AGI. We assume that generalized states of the world are valuable - not rewards themselves, and propose an extension of AIXI, in which rewards are used only to bootstrap hierarchical value learning. The modified AIXI agent is considered in the multi-agent environment, where other agents can be either humans or other "mature" agents, which values should be revealed and adopted by the "infant" AGI agent. General framework for designing such empathic agent with ethical bias is proposed also as an extension of the universal intelligence model. Moreover, we perform experiments in the simple Markov environment, which demonstrate feasibility of our approach to value learning in safe AGI.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Reinforcement Learning in Robotics