Matroid Semi-Bandits in Sublinear Time

Ruo-Chun Tzeng; Naoto Ohsaka; Kaito Ariu

arXiv:2405.17968·cs.LG·May 29, 2024

Matroid Semi-Bandits in Sublinear Time

Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu

PDF

Open Access

TL;DR

This paper introduces FasterCUCB, a new algorithm for matroid semi-bandits that achieves sublinear per-round time complexity for common matroid classes, maintaining near-optimal regret bounds.

Contribution

It proposes FasterCUCB, a computationally efficient algorithm with sublinear time per round for matroid semi-bandits, using approximate maximum-weight basis maintenance.

Findings

01

Achieves $O(D ext{ polylog}(K) ext{ polylog}(T))$ time for uniform, partition, and graphical matroids.

02

Achieves $O(D ext{sqrt{K}} ext{ polylog}(T))$ time for transversal matroids.

03

Maintains regret bounds comparable to CUCB, matching the asymptotic lower bound.

Abstract

We study the matroid semi-bandits problem, where at each round the learner plays a subset of $K$ arms from a feasible set, and the goal is to maximize the expected cumulative linear rewards. Existing algorithms have per-round time complexity at least $Ω (K)$ , which becomes expensive when $K$ is large. To address this computational issue, we propose FasterCUCB whose sampling rule takes time sublinear in $K$ for common classes of matroids: $O (D polylog (K) polylog (T))$ for uniform matroids, partition matroids, and graphical matroids, and $O (D K polylog (T))$ for transversal matroids. Here, $D$ is the maximum number of elements in any feasible subset of arms, and $T$ is the horizon. Our technique is based on dynamic maintenance of an approximate maximum-weight basis over inner-product weights. Although the introduction of an approximate maximum-weight basis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research