A computational framework for human values

Nardine Osman; Mark d'Inverno

arXiv:2305.02748·cs.AI·February 9, 2026·5 cites

A computational framework for human values

Nardine Osman, Mark d'Inverno

PDF

Open Access

TL;DR

This paper introduces a formal computational framework based on social sciences to systematically understand and incorporate human values into ethical AI design, addressing a key gap in value alignment research.

Contribution

It proposes the first formal, computational definition of human values rooted in social sciences, enabling interdisciplinary investigation for ethical AI development.

Findings

01

Provides a foundational formal framework for human values

02

Facilitates systematic investigation into value-based AI ethics

03

Supports development of AI aligned with human values

Abstract

In the diverse array of work investigating the nature of human values from psychology, philosophy and social sciences, there is a clear consensus that values guide behaviour. More recently, a recognition that values provide a means to engineer ethical AI has emerged. Indeed, Stuart Russell proposed shifting AI's focus away from simply ``intelligence'' towards intelligence ``provably aligned with human values''. This challenge -- the value alignment problem -- with others including an AI's learning of human values, aggregating individual values to groups, and designing computational mechanisms to reason over values, has energised a sustained research effort. Despite this, no formal, computational definition of values has yet been proposed. We address this through a formal conceptual framework rooted in the social sciences, that provides a foundation for the systematic, integrated and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPsychology of Moral and Emotional Judgment · Ethics and Social Impacts of AI · Social and Intergroup Psychology