Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback
Mingrui Zhang, Lin Chen, Hamed Hassani, Amin Karbasi

TL;DR
This paper introduces three novel online algorithms for continuous and discrete submodular maximization, reducing gradient evaluations and achieving new regret bounds in full-information and bandit feedback settings.
Contribution
The paper presents the first bandit algorithm for continuous DR-submodular maximization and extends it to discrete cases, improving efficiency and regret bounds.
Findings
Mono-Frank-Wolfe reduces gradient evaluations to 1 per function.
Bandit-Frank-Wolfe is the first bandit algorithm for continuous DR-submodular maximization.
Responsive-Frank-Wolfe achieves regret bounds in the discrete bandit setting.
Abstract
In this paper, we propose three online algorithms for submodular maximisation. The first one, Mono-Frank-Wolfe, reduces the number of per-function gradient evaluations from [Chen2018Online] and [chen2018projection] to 1, and achieves a -regret bound of . The second one, Bandit-Frank-Wolfe, is the first bandit algorithm for continuous DR-submodular maximization, which achieves a -regret bound of . Finally, we extend Bandit-Frank-Wolfe to a bandit algorithm for discrete submodular maximization, Responsive-Frank-Wolfe, which attains a -regret bound of in the responsive bandit setting.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Cryptography and Data Security · Internet Traffic Analysis and Secure E-voting
