Value Alignment, Fair Play, and the Rights of Service Robots
Daniel Estrada

TL;DR
This paper revisits Turing's concept of fair play to propose a new perspective on AI alignment, emphasizing mutual accommodation and extending rights to service robots as a means of fostering alignment of interests.
Contribution
It offers a novel interpretation of the Turing test based on fair play as a measure of value alignment and discusses extending robot rights to promote human-machine interest alignment.
Findings
Fair play interpretation links Turing test to value alignment
Extending robot rights can promote mutual interest alignment
Reframes AI safety discussions within a fairness and rights context
Abstract
Ethics and safety research in artificial intelligence is increasingly framed in terms of "alignment" with human values and interests. I argue that Turing's call for "fair play for machines" is an early and often overlooked contribution to the alignment literature. Turing's appeal to fair play suggests a need to correct human behavior to accommodate our machines, a surprising inversion of how value alignment is treated today. Reflections on "fair play" motivate a novel interpretation of Turing's notorious "imitation game" as a condition not of intelligence but instead of value alignment: a machine demonstrates a minimal degree of alignment (with the norms of conversation, for instance) when it can go undetected when interrogated by a human. I carefully distinguish this interpretation from the Moral Turing Test, which is not motivated by a principle of fair play, but instead depends on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
