Online Learning for Non-monotone Submodular Maximization: From Full Information to Bandit Feedback
Qixin Zhang, Zengde Deng, Zaiyi Chen, Kuangqi Zhou, Haoyuan Hu, Yu, Yang

TL;DR
This paper introduces new algorithms for online non-monotone submodular maximization over convex sets, achieving sublinear regret bounds in full information, one-shot, and bandit feedback settings, with practical efficiency and empirical validation.
Contribution
It presents the Meta-MFW, Mono-MFW, and Bandit-MFW algorithms, the first to achieve sublinear regret in various feedback models for this problem, improving efficiency and applicability.
Findings
Meta-MFW achieves $1/e$-regret of $O(\sqrt{T})$ with $T^{3/2}$ gradient evaluations.
Mono-MFW reduces evaluations to 1 per function with $O(T^{4/5})$ regret.
Bandit-MFW attains $O(T^{8/9})$ regret in the bandit setting.
Abstract
In this paper, we revisit the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set, which finds wide real-world applications in the domain of machine learning, economics, and operations research. At first, we present the Meta-MFW algorithm achieving a -regret of at the cost of stochastic gradient evaluations per round. As far as we know, Meta-MFW is the first algorithm to obtain -regret of for the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set. Furthermore, in sharp contrast with ODC algorithm \citep{thang2021online}, Meta-MFW relies on the simple online linear oracle without discretization, lifting, or rounding operations. Considering the practical restrictions, we then propose the Mono-MFW algorithm, which reduces the per-function stochastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques
