Bandit Learning in Matching Markets: Utilitarian and Rawlsian   Perspectives

Hadi Hosseini; Duohan Zhang

arXiv:2412.00301·cs.LG·December 3, 2024

Bandit Learning in Matching Markets: Utilitarian and Rawlsian Perspectives

Hadi Hosseini, Duohan Zhang

PDF

Open Access

TL;DR

This paper explores learning algorithms for two-sided matching markets using bandit models, aiming to optimize welfare metrics like utilitarian and Rawlsian while ensuring market stability, with theoretical analysis and simulations.

Contribution

It introduces algorithms based on epoch Explore-Then-Commit for welfare optimization in matching markets with unknown preferences, analyzing their regret bounds.

Findings

01

Algorithms achieve sublinear regret bounds.

02

Welfare metrics improve with more learning epochs.

03

Market stability is maintained during learning.

Abstract

Two-sided matching markets have demonstrated significant impact in many real-world applications, including school choice, medical residency placement, electric vehicle charging, ride sharing, and recommender systems. However, traditional models often assume that preferences are known, which is not always the case in modern markets, where preferences are unknown and must be learned. For example, a company may not know its preference over all job applicants a priori in online markets. Recent research has modeled matching markets as multi-armed bandit (MAB) problem and primarily focused on optimizing matching for one side of the market, while often resulting in a pessimal solution for the other side. In this paper, we adopt a welfarist approach for both sides of the market, focusing on two metrics: (1) Utilitarian welfare and (2) Rawlsian welfare, while maintaining market stability. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Behavioral Economics Studies · Auction Theory and Applications · Game Theory and Applications

MethodsADaptive gradient method with the OPTimal convergence rate