Learn to Match with No Regret: Reinforcement Learning in Markov Matching   Markets

Yifei Min; Tianhao Wang; Ruitu Xu; Zhaoran Wang; Michael I. Jordan,; Zhuoran Yang

arXiv:2203.03684·cs.LG·March 9, 2022

Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Yifei Min, Tianhao Wang, Ruitu Xu, Zhaoran Wang, Michael I. Jordan,, Zhuoran Yang

PDF

Open Access 1 Video

TL;DR

This paper introduces a reinforcement learning framework for Markov matching markets, enabling a planner to optimize social welfare while agents seek stable matchings, with proven sublinear regret guarantees.

Contribution

It develops a novel RL algorithm combining optimistic value iteration with maximum weight matching for dynamic markets.

Findings

01

The algorithm achieves sublinear regret in the Markov matching setting.

02

It effectively balances exploration and stability in dynamic matching markets.

03

The framework applies to real-world scenarios like ridesharing platforms.

Abstract

We study a Markov matching market involving a planner and a set of strategic agents on the two sides of the market. At each step, the agents are presented with a dynamical context, where the contexts determine the utilities. The planner controls the transition of the contexts to maximize the cumulative social welfare, while the agents aim to find a myopic stable matching at each step. Such a setting captures a range of applications including ridesharing platforms. We formalize the problem by proposing a reinforcement learning framework that integrates optimistic value iteration with maximum weight matching. The proposed algorithm addresses the coupled challenges of sequential exploration, matching stability, and function approximation. We prove that the algorithm achieves sublinear regret.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets· slideslive

Taxonomy

TopicsTransportation and Mobility Innovations · Sharing Economy and Platforms · Organ Donation and Transplantation