Learning the Value Systems of Societies with Preference-based Multi-objective Reinforcement Learning

Andr\'es Holgado-S\'anchez; Peter Vamplew; Richard Dazeley; Sascha Ossowski; Holger Billhardt

arXiv:2602.08835·cs.AI·February 12, 2026

Learning the Value Systems of Societies with Preference-based Multi-objective Reinforcement Learning

Andr\'es Holgado-S\'anchez, Peter Vamplew, Richard Dazeley, Sascha Ossowski, Holger Billhardt

PDF

Open Access

TL;DR

This paper introduces algorithms that learn societal value systems and aligned policies in multi-agent MDPs using clustering and preference-based multi-objective reinforcement learning, enabling better understanding of diverse human preferences.

Contribution

It presents a novel approach combining clustering and PbMORL to model and learn multiple societal value systems and corresponding policies in a unified framework.

Findings

01

Outperforms state-of-the-art PbMORL algorithms on value alignment tasks.

02

Effectively models diverse user preferences through clustering.

03

Produces approximately Pareto-optimal policies aligned with learned value systems.

Abstract

Value-aware AI should recognise human values and adapt to the value systems (value-based preferences) of different users. This requires operationalization of values, which can be prone to misspecification. The social nature of values demands their representation to adhere to multiple users while value systems are diverse, yet exhibit patterns among groups. In sequential decision making, efforts have been made towards personalization for different goals or values from demonstrations of diverse agents. However, these approaches demand manually designed features or lack value-based interpretability and/or adaptability to diverse user preferences. We propose algorithms for learning models of value alignment and value systems for a society of agents in Markov Decision Processes (MDPs), based on clustering and preference-based multi-objective reinforcement learning (PbMORL). We jointly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Mobile Crowdsensing and Crowdsourcing