Learning Individual Policies in Large Multi-agent Systems through Local   Variance Minimization

Tanvi Verma; Pradeep Varakantham

arXiv:2212.13379·cs.MA·December 29, 2022

Learning Individual Policies in Large Multi-agent Systems through Local Variance Minimization

Tanvi Verma, Pradeep Varakantham

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent reinforcement learning mechanism for large systems, minimizing variance among agents' values to improve fairness and revenue, demonstrated on taxi data and simulations.

Contribution

The paper proposes a new MARL mechanism based on variance minimization in large multi-agent systems with nearly non-atomic agents, enhancing fairness and revenue.

Findings

01

Reduces revenue variance among agents

02

Achieves higher joint revenues than existing methods

03

Effective on real-world taxi data and simulations

Abstract

In multi-agent systems with large number of agents, typically the contribution of each agent to the value of other agents is minimal (e.g., aggregation systems such as Uber, Deliveroo). In this paper, we consider such multi-agent systems where each agent is self-interested and takes a sequence of decisions and represent them as a Stochastic Non-atomic Congestion Game (SNCG). We derive key properties for equilibrium solutions in SNCG model with non-atomic and also nearly non-atomic agents. With those key equilibrium properties, we provide a novel Multi-Agent Reinforcement Learning (MARL) mechanism that minimizes variance across values of agents in the same state. To demonstrate the utility of this new mechanism, we provide detailed results on a real-world taxi dataset and also a generic simulator for aggregation systems. We show that our approach reduces the variance in revenues earned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTransportation and Mobility Innovations · Auction Theory and Applications · Transportation Planning and Optimization