Merit-based Fair Combinatorial Semi-Bandit with Unrestricted Feedback   Delays

Ziqun Chen; Kechao Cai; Zhuoyue Chen; Jinbei Zhang; John C.S. Lui

arXiv:2407.15439·cs.LG·July 30, 2024

Merit-based Fair Combinatorial Semi-Bandit with Unrestricted Feedback Delays

Ziqun Chen, Kechao Cai, Zhuoyue Chen, Jinbei Zhang, John C.S. Lui

PDF

Open Access 2 Repos

TL;DR

This paper addresses the challenge of designing fair combinatorial semi-bandit algorithms under unrestricted feedback delays, introducing new algorithms with proven sublinear regret and demonstrating their effectiveness through experiments.

Contribution

It introduces novel algorithms for fair combinatorial semi-bandits with unrestricted delays, achieving sublinear regret bounds and handling reward-independent and reward-dependent delays.

Findings

01

Algorithms achieve sublinear reward regret.

02

Algorithms achieve sublinear fairness regret.

03

Effective in synthetic and real-world scenarios.

Abstract

We study the stochastic combinatorial semi-bandit problem with unrestricted feedback delays under merit-based fairness constraints. This is motivated by applications such as crowdsourcing, and online advertising, where immediate feedback is not immediately available and fairness among different choices (or arms) is crucial. We consider two types of unrestricted feedback delays: reward-independent delays where the feedback delays are independent of the rewards, and reward-dependent delays where the feedback delays are correlated with the rewards. Furthermore, we introduce merit-based fairness constraints to ensure a fair selection of the arms. We define the reward regret and the fairness regret and present new bandit algorithms to select arms under unrestricted feedback delays based on their merits. We prove that our algorithms all achieve sublinear expected reward regret and expected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems