Multi-objective Reinforcement Learning: A Tool for Pluralistic Alignment

Peter Vamplew; Conor F Hayes; Cameron Foale; Richard Dazeley; Hadassah; Harland

arXiv:2410.11221·cs.LG·October 16, 2024

Multi-objective Reinforcement Learning: A Tool for Pluralistic Alignment

Peter Vamplew, Conor F Hayes, Cameron Foale, Richard Dazeley, Hadassah, Harland

PDF

Open Access

TL;DR

This paper discusses how multi-objective reinforcement learning (MORL) can help develop AI systems aligned with multiple conflicting values, addressing limitations of traditional scalar reward-based RL.

Contribution

It provides an overview of MORL's potential role in creating pluralistically-aligned AI systems, highlighting its advantages over scalar RL.

Findings

01

MORL uses vector rewards to handle multiple conflicting objectives.

02

MORL offers a promising approach for aligning AI with diverse stakeholder values.

03

The paper emphasizes MORL's importance in pluralistic AI alignment.

Abstract

Reinforcement learning (RL) is a valuable tool for the creation of AI systems. However it may be problematic to adequately align RL based on scalar rewards if there are multiple conflicting values or stakeholders to be considered. Over the last decade multi-objective reinforcement learning (MORL) using vector rewards has emerged as an alternative to standard, scalar RL. This paper provides an overview of the role which MORL can play in creating pluralistically-aligned AI.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpen Source Software Innovations · Digital Platforms and Economics

MethodsALIGN