Can Machines Learn Morality? The Delphi Experiment

Liwei Jiang; Jena D. Hwang; Chandra Bhagavatula; Ronan Le Bras; Jenny; Liang; Jesse Dodge; Keisuke Sakaguchi; Maxwell Forbes; Jon Borchardt; Saadia; Gabriel; Yulia Tsvetkov; Oren Etzioni; Maarten Sap; Regina Rini; Yejin Choi

arXiv:2110.07574·cs.CL·July 14, 2022

Can Machines Learn Morality? The Delphi Experiment

Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jenny, Liang, Jesse Dodge, Keisuke Sakaguchi, Maxwell Forbes, Jon Borchardt, Saadia, Gabriel, Yulia Tsvetkov, Oren Etzioni, Maarten Sap, Regina Rini, Yejin Choi

PDF

1 Repo 1 Datasets

TL;DR

This paper introduces Delphi, a neural network framework for teaching AI systems moral reasoning based on ethical judgments, revealing both its potential and current limitations in machine ethics.

Contribution

The paper presents Delphi, a novel deep learning approach for modeling moral reasoning in AI, and analyzes its capabilities and biases in ethical decision-making.

Findings

01

Delphi generalizes well to new ethical situations

02

Neural networks show poor moral judgment without explicit training

03

Biases and inconsistencies are prevalent in current models

Abstract

As AI systems become increasingly powerful and pervasive, there are growing concerns about machines' morality or a lack thereof. Yet, teaching morality to machines is a formidable task, as morality remains among the most intensely debated questions in humanity, let alone for AI. Existing AI systems deployed to millions of users, however, are already making decisions loaded with moral implications, which poses a seemingly impossible challenge: teaching machines moral sense, while humanity continues to grapple with it. To explore this challenge, we introduce Delphi, an experimental framework based on deep neural networks trained directly to reason about descriptive ethical judgments, e.g., "helping a friend" is generally good, while "helping a friend spread fake news" is not. Empirical results shed novel insights on the promises and limits of machine ethics; Delphi demonstrates strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liweijiang/delphi
tf

Datasets

Marlon154/moral-number-corpus
dataset· 7 dl
7 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Dropout · Layer Normalization · Dense Connections · Cosine Annealing · 15 Ways to Contact How can i speak to someone at Delta Airlines