Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance

Dimitris Michailidis; Willem R\"opke; Diederik M. Roijers; Sennay Ghebreab; Fernando P. Santos

arXiv:2411.18195·cs.LG·March 27, 2026

Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance

Dimitris Michailidis, Willem R\"opke, Diederik M. Roijers, Sennay Ghebreab, Fernando P. Santos

PDF

1 Repo

TL;DR

This paper presents a scalable multi-objective reinforcement learning algorithm that incorporates fairness using Lorenz dominance, enabling equitable policy discovery in complex, many-objective environments like large-scale transport planning.

Contribution

It introduces a novel Lorenz dominance-based approach with lambda-Lorenz dominance for flexible fairness, improving scalability and fairness in many-objective reinforcement learning.

Findings

01

Outperforms existing methods in high-dimensional objective spaces

02

Encourages discovery of fair policies in large-scale environments

03

Demonstrates scalability in real-world transport planning scenarios

Abstract

Multi-Objective Reinforcement Learning (MORL) aims to learn a set of policies that optimize trade-offs between multiple, often conflicting objectives. MORL is computationally more complex than single-objective RL, particularly as the number of objectives increases. Additionally, when objectives involve the preferences of agents or groups, incorporating fairness becomes both important and socially desirable. This paper introduces a principled algorithm that incorporates fairness into MORL while improving scalability to many-objective problems. We propose using Lorenz dominance to identify policies with equitable reward distributions and introduce lambda-Lorenz dominance to enable flexible fairness preferences. We release a new, large-scale real-world transport planning environment and demonstrate that our method encourages the discovery of fair policies, showing improved scalability in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dimichai/mo-tndp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training