Value Alignment, Fair Play, and the Rights of Service Robots

Daniel Estrada

arXiv:1803.02852·cs.CY·December 7, 2018

Value Alignment, Fair Play, and the Rights of Service Robots

Daniel Estrada

PDF

TL;DR

This paper revisits Turing's concept of fair play to propose a new perspective on AI alignment, emphasizing mutual accommodation and extending rights to service robots as a means of fostering alignment of interests.

Contribution

It offers a novel interpretation of the Turing test based on fair play as a measure of value alignment and discusses extending robot rights to promote human-machine interest alignment.

Findings

01

Fair play interpretation links Turing test to value alignment

02

Extending robot rights can promote mutual interest alignment

03

Reframes AI safety discussions within a fairness and rights context

Abstract

Ethics and safety research in artificial intelligence is increasingly framed in terms of "alignment" with human values and interests. I argue that Turing's call for "fair play for machines" is an early and often overlooked contribution to the alignment literature. Turing's appeal to fair play suggests a need to correct human behavior to accommodate our machines, a surprising inversion of how value alignment is treated today. Reflections on "fair play" motivate a novel interpretation of Turing's notorious "imitation game" as a condition not of intelligence but instead of value alignment: a machine demonstrates a minimal degree of alignment (with the norms of conversation, for instance) when it can go undetected when interrogated by a human. I carefully distinguish this interpretation from the Moral Turing Test, which is not motivated by a principle of fair play, but instead depends on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.