Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu

TL;DR
This paper introduces FasterCUCB, a new algorithm for matroid semi-bandits that achieves sublinear per-round time complexity for common matroid classes, maintaining near-optimal regret bounds.
Contribution
It proposes FasterCUCB, a computationally efficient algorithm with sublinear time per round for matroid semi-bandits, using approximate maximum-weight basis maintenance.
Findings
Achieves $O(D ext{ polylog}(K) ext{ polylog}(T))$ time for uniform, partition, and graphical matroids.
Achieves $O(D ext{sqrt{K}} ext{ polylog}(T))$ time for transversal matroids.
Maintains regret bounds comparable to CUCB, matching the asymptotic lower bound.
Abstract
We study the matroid semi-bandits problem, where at each round the learner plays a subset of arms from a feasible set, and the goal is to maximize the expected cumulative linear rewards. Existing algorithms have per-round time complexity at least , which becomes expensive when is large. To address this computational issue, we propose FasterCUCB whose sampling rule takes time sublinear in for common classes of matroids: for uniform matroids, partition matroids, and graphical matroids, and for transversal matroids. Here, is the maximum number of elements in any feasible subset of arms, and is the horizon. Our technique is based on dynamic maintenance of an approximate maximum-weight basis over inner-product weights. Although the introduction of an approximate maximum-weight basis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research
