A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Chengchun Shi, Runzhe Wan, Ge Song, Shikai Luo, Rui Song, Hongtu, Zhu

TL;DR
This paper introduces a multi-agent reinforcement learning framework to evaluate policies in large-scale, spatially and temporally interconnected two-sided markets like ride-sharing, addressing high-dimensionality and interference challenges.
Contribution
The paper develops novel estimators for policy evaluation in high-dimensional, interference-prone two-sided markets using a multi-agent RL approach, validated through simulations and real data.
Findings
Estimators are consistent despite high dimensionality.
Simulation experiments show favorable performance.
Real data application demonstrates practical utility.
Abstract
The two-sided markets such as ride-sharing companies often involve a group of subjects who are making sequential decisions across time and/or location. With the rapid development of smart phones and internet of things, they have substantially transformed the transportation landscape of human beings. In this paper we consider large-scale fleet management in ride-sharing companies that involve multiple units in different areas receiving sequences of products (or treatments) over time. Major technical challenges, such as policy evaluation, arise in those studies because (i) spatial and temporal proximities induce interference between locations and times; and (ii) the large number of locations results in the curse of dimensionality. To address both challenges simultaneously, we introduce a multi-agent reinforcement learning (MARL) framework for carrying policy evaluation in these studies.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Sharing Economy and Platforms · Transportation Planning and Optimization
