Thompson Sampling for Bandit Learning in Matching Markets

Fang Kong; Junming Yin; Shuai Li

arXiv:2204.12048·cs.LG·May 3, 2022·1 cites

Thompson Sampling for Bandit Learning in Matching Markets

Fang Kong, Junming Yin, Shuai Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces the first regret analysis of Thompson Sampling in iterative matching markets, demonstrating its practical advantages over traditional explore-then-commit and UCB algorithms through extensive experiments.

Contribution

It provides the first theoretical regret analysis of Thompson Sampling in matching markets and shows its empirical benefits over existing methods.

Findings

01

Thompson Sampling outperforms ETC and UCB algorithms in experiments.

02

The paper offers the first regret bounds for TS in matching markets.

03

TS demonstrates practical advantages in real-world applications.

Abstract

The problem of two-sided matching markets has a wide range of real-world applications and has been extensively studied in the literature. A line of recent works have focused on the problem setting where the preferences of one-side market participants are unknown \emph{a priori} and are learned by iteratively interacting with the other side of participants. All these works are based on explore-then-commit (ETC) and upper confidence bound (UCB) algorithms, two common strategies in multi-armed bandits (MAB). Thompson sampling (TS) is another popular approach, which attracts lots of attention due to its easier implementation and better empirical performances. In many problems, even when UCB and ETC-type algorithms have already been analyzed, researchers are still trying to study TS for its benefits. However, the convergence analysis of TS is much more challenging and remains open in many…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fangkongx/tsformatchingmarkets
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms

MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Linear Layer · Relative Position Encodings · InfoNCE · Residual Connection · Global-Local Attention · Layer Normalization · Contrastive Predictive Coding