A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation   in Two-sided Markets

Chengchun Shi; Runzhe Wan; Ge Song; Shikai Luo; Rui Song; Hongtu; Zhu

arXiv:2202.10574·stat.ML·March 28, 2023·1 cites

A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets

Chengchun Shi, Runzhe Wan, Ge Song, Shikai Luo, Rui Song, Hongtu, Zhu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-agent reinforcement learning framework to evaluate policies in large-scale, spatially and temporally interconnected two-sided markets like ride-sharing, addressing high-dimensionality and interference challenges.

Contribution

The paper develops novel estimators for policy evaluation in high-dimensional, interference-prone two-sided markets using a multi-agent RL approach, validated through simulations and real data.

Findings

01

Estimators are consistent despite high dimensionality.

02

Simulation experiments show favorable performance.

03

Real data application demonstrates practical utility.

Abstract

The two-sided markets such as ride-sharing companies often involve a group of subjects who are making sequential decisions across time and/or location. With the rapid development of smart phones and internet of things, they have substantially transformed the transportation landscape of human beings. In this paper we consider large-scale fleet management in ride-sharing companies that involve multiple units in different areas receiving sequences of products (or treatments) over time. Major technical challenges, such as policy evaluation, arise in those studies because (i) spatial and temporal proximities induce interference between locations and times; and (ii) the large number of locations results in the curse of dimensionality. To address both challenges simultaneously, we introduce a multi-agent reinforcement learning (MARL) framework for carrying policy evaluation in these studies.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

runzhestat/causalmarl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTransportation and Mobility Innovations · Sharing Economy and Platforms · Transportation Planning and Optimization