Combinatorial Multi-armed Bandits: Arm Selection via Group Testing

Arpan Mukherjee; Shashanka Ubaru; Keerthiram Murugesan; Karthikeyan Shanmugam; Ali Tajer

arXiv:2410.10679·cs.LG·August 14, 2025

Combinatorial Multi-armed Bandits: Arm Selection via Group Testing

Arpan Mukherjee, Shashanka Ubaru, Keerthiram Murugesan, Karthikeyan Shanmugam, Ali Tajer

PDF

Open Access

TL;DR

This paper introduces a new algorithm for combinatorial multi-armed bandits that significantly reduces computational complexity by replacing the exact oracle with group testing and quantized Thompson sampling, maintaining optimal regret.

Contribution

The paper presents a novel approach combining group testing and quantized Thompson sampling to efficiently select super-arms with reduced complexity.

Findings

01

Reduces super-arm selection complexity to logarithmic in the number of arms.

02

Achieves the same regret bounds as state-of-the-art algorithms with exact oracles.

03

Provides an exponential reduction in computational complexity.

Abstract

This paper considers the problem of combinatorial multi-armed bandits with semi-bandit feedback and a cardinality constraint on the super-arm size. Existing algorithms for solving this problem typically involve two key sub-routines: (1) a parameter estimation routine that sequentially estimates a set of base-arm parameters, and (2) a super-arm selection policy for selecting a subset of base arms deemed optimal based on these parameters. State-of-the-art algorithms assume access to an exact oracle for super-arm selection with unbounded computational power. At each instance, this oracle evaluates a list of score functions, the number of which grows as low as linearly and as high as exponentially with the number of arms. This can be prohibitive in the regime of a large number of arms. This paper introduces a novel realistic alternative to the perfect oracle. This algorithm uses a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Spam and Phishing Detection · Optimization and Search Problems

MethodsBalanced Selection · Sparse Evolutionary Training