Stochastic $k$-Submodular Bandits with Full Bandit Feedback

Guanyu Nie; Vaneet Aggarwal; Christopher John Quinn

arXiv:2412.10682·cs.LG·December 17, 2024

Stochastic $k$-Submodular Bandits with Full Bandit Feedback

Guanyu Nie, Vaneet Aggarwal, Christopher John Quinn

PDF

Open Access

TL;DR

This paper introduces the first sublinear regret bounds for online $k$-submodular bandit problems with full feedback, using offline-to-online transformation techniques for various function types and constraints.

Contribution

It develops online algorithms for multiple $k$-submodular bandit problems by transforming offline approximation algorithms, and analyzes their robustness in the online setting.

Findings

01

Achieved sublinear $eta$-regret bounds for different $k$-submodular bandit scenarios.

02

Extended offline-to-online framework to handle various constraints and function types.

03

Provided theoretical analysis of the robustness of offline algorithms in online contexts.

Abstract

In this paper, we present the first sublinear $α$ -regret bounds for online $k$ -submodular optimization problems with full-bandit feedback, where $α$ is a corresponding offline approximation ratio. Specifically, we propose online algorithms for multiple $k$ -submodular stochastic combinatorial multi-armed bandit problems, including (i) monotone functions and individual size constraints, (ii) monotone functions with matroid constraints, (iii) non-monotone functions with matroid constraints, (iv) non-monotone functions without constraints, and (v) monotone functions without constraints. We transform approximation algorithms for offline $k$ -submodular maximization problems into online algorithms through the offline-to-online framework proposed by Nie et al. (2023a). A key contribution of our work is analyzing the robustness of the offline algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Optimization and Search Problems