Achieving Equilibrium under Utility Heterogeneity: An Agent-Attention Framework for Multi-Agent Multi-Objective Reinforcement Learning

Zhuhui Li; Chunbo Luo; Liming Huang; Luyu Qi; Geyong Min

arXiv:2511.08926·cs.MA·November 13, 2025

Achieving Equilibrium under Utility Heterogeneity: An Agent-Attention Framework for Multi-Agent Multi-Objective Reinforcement Learning

Zhuhui Li, Chunbo Luo, Liming Huang, Luyu Qi, Geyong Min

PDF

Open Access 1 Video

TL;DR

This paper introduces AA-MAMORL, a novel framework enabling decentralized agents in multi-objective systems to learn policies that approximate Bayesian Nash Equilibrium by implicitly modeling global utilities during training.

Contribution

It provides a theoretical proof of the necessity of global utility modeling for equilibrium and proposes a method that learns this implicitly during centralized training.

Findings

01

AA-MAMORL outperforms existing methods in experiments.

02

Access to global preferences improves decision-making.

03

Agents can act independently without communication during execution.

Abstract

Multi-agent multi-objective systems (MAMOS) have emerged as powerful frameworks for modelling complex decision-making problems across various real-world domains, such as robotic exploration, autonomous traffic management, and sensor network optimisation. MAMOS offers enhanced scalability and robustness through decentralised control and more accurately reflects inherent trade-offs between conflicting objectives. In MAMOS, each agent uses utility functions that map return vectors to scalar values. Existing MAMOS optimisation methods face challenges in handling heterogeneous objective and utility function settings, where training non-stationarity is intensified due to private utility functions and the associated policies. In this paper, we first theoretically prove that direct access to, or structured modeling of, global utility functions is necessary for the Bayesian Nash Equilibrium…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Achieving Equilibrium Under Utility Heterogeneity: An Agent-Attention Framework for Multi-Agent Multi-Objective Reinforcement Learning· underline

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Advanced Multi-Objective Optimization Algorithms