Adaptive Alignment: Dynamic Preference Adjustments via Multi-Objective   Reinforcement Learning for Pluralistic AI

Hadassah Harland; Richard Dazeley; Peter Vamplew; Hashini Senaratne,; Bahareh Nakisa; Francisco Cruz

arXiv:2410.23630·cs.LG·November 1, 2024

Adaptive Alignment: Dynamic Preference Adjustments via Multi-Objective Reinforcement Learning for Pluralistic AI

Hadassah Harland, Richard Dazeley, Peter Vamplew, Hashini Senaratne,, Bahareh Nakisa, Francisco Cruz

PDF

Open Access

TL;DR

This paper presents a dynamic method for aligning AI systems with diverse human preferences using Multi-Objective Reinforcement Learning, enabling adaptive and retroactive preference adjustments to better serve pluralistic values.

Contribution

It introduces a novel framework for post-learning preference adjustment in AI alignment through MORL, addressing the challenge of shifting human values.

Findings

01

Proposed framework effectively adjusts AI policies to changing preferences.

02

Discussed advantages of retroactive alignment in sociotechnical systems.

03

Outlined implementation details and potential benefits of the approach.

Abstract

Emerging research in Pluralistic Artificial Intelligence (AI) alignment seeks to address how intelligent systems can be designed and deployed in accordance with diverse human needs and values. We contribute to this pursuit with a dynamic approach for aligning AI with diverse and shifting user preferences through Multi Objective Reinforcement Learning (MORL), via post-learning policy selection adjustment. In this paper, we introduce the proposed framework for this approach, outline its anticipated advantages and assumptions, and discuss technical details about the implementation. We also examine the broader implications of adopting a retroactive alignment approach through the sociotechnical systems perspective.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConsumer Market Behavior and Pricing