Queueing Matching Bandits with Preference Feedback

Jung-hun Kim; Min-hwan Oh

arXiv:2410.10098·stat.ML·May 7, 2025

Queueing Matching Bandits with Preference Feedback

Jung-hun Kim, Min-hwan Oh

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces algorithms for queueing systems with preference-based server-job matching, balancing system stability and learning unknown service rates, with proven bounds and experimental validation.

Contribution

It proposes UCB and Thompson Sampling algorithms for queueing matching with preference feedback, achieving stability and sublinear regret bounds.

Findings

01

Algorithms stabilize queues with bounded average length.

02

Regret bounds are sublinear in time horizon.

03

Experimental results confirm theoretical performance.

Abstract

In this study, we consider multi-class multi-server asymmetric queueing systems consisting of $N$ queues on one side and $K$ servers on the other side, where jobs randomly arrive in queues at each time. The service rate of each job-server assignment is unknown and modeled by a feature-based Multi-nomial Logit (MNL) function. At each time, a scheduler assigns jobs to servers, and each server stochastically serves at most one job based on its preferences over the assigned jobs. The primary goal of the algorithm is to stabilize the queues in the system while learning the service rates of servers. To achieve this goal, we propose algorithms based on UCB and Thompson Sampling, which achieve system stability with an average queue length bound of $O (min {N, K} / ϵ)$ for a large time horizon $T$ , where $ϵ$ is a traffic slackness of the system. Furthermore, the algorithms achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

junghunkim7786/queueing-matching-bandits-with-preference-feedback
noneOfficial

Videos

Queueing Matching Bandits with Preference Feedback· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Auction Theory and Applications

Methodstravel james