Foundational Moral Values for AI Alignment

Betty Li Hou; Brian Patrick Green

arXiv:2311.17017·cs.CY·November 29, 2023·2 cites

Foundational Moral Values for AI Alignment

Betty Li Hou, Brian Patrick Green

PDF

Open Access 1 Datasets

TL;DR

This paper proposes five foundational moral values derived from philosophy—survival, sustainability, society, education, and truth—as a robust framework to guide AI alignment efforts and address associated threats and opportunities.

Contribution

It introduces a philosophically grounded set of core values to improve the clarity and robustness of AI alignment targets.

Findings

01

Provides a structured framework for AI alignment based on moral philosophy.

02

Highlights how AI systems can threaten or support these core values.

03

Offers a basis for future technical and ethical AI development.

Abstract

Solving the AI alignment problem requires having clear, defensible values towards which AI systems can align. Currently, targets for alignment remain underspecified and do not seem to be built from a philosophically robust structure. We begin the discussion of this problem by presenting five core, foundational values, drawn from moral philosophy and built on the requisites for human existence: survival, sustainable intergenerational existence, society, education, and truth. We show that these values not only provide a clearer direction for technical alignment work, but also serve as a framework to highlight threats and opportunities from AI systems to both obtain and sustain these values.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

jpamarlphi-byte/Draft_AI_AGI_ASI_Universal_Ethical_Charter
dataset· 13 dl
13 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Scientific Computing and Data Management · Explainable Artificial Intelligence (XAI)