Online Learning for Non-monotone Submodular Maximization: From Full   Information to Bandit Feedback

Qixin Zhang; Zengde Deng; Zaiyi Chen; Kuangqi Zhou; Haoyuan Hu; Yu; Yang

arXiv:2208.07632·cs.LG·August 17, 2022·1 cites

Online Learning for Non-monotone Submodular Maximization: From Full Information to Bandit Feedback

Qixin Zhang, Zengde Deng, Zaiyi Chen, Kuangqi Zhou, Haoyuan Hu, Yu, Yang

PDF

Open Access

TL;DR

This paper introduces new algorithms for online non-monotone submodular maximization over convex sets, achieving sublinear regret bounds in full information, one-shot, and bandit feedback settings, with practical efficiency and empirical validation.

Contribution

It presents the Meta-MFW, Mono-MFW, and Bandit-MFW algorithms, the first to achieve sublinear regret in various feedback models for this problem, improving efficiency and applicability.

Findings

01

Meta-MFW achieves $1/e$-regret of $O(\sqrt{T})$ with $T^{3/2}$ gradient evaluations.

02

Mono-MFW reduces evaluations to 1 per function with $O(T^{4/5})$ regret.

03

Bandit-MFW attains $O(T^{8/9})$ regret in the bandit setting.

Abstract

In this paper, we revisit the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set, which finds wide real-world applications in the domain of machine learning, economics, and operations research. At first, we present the Meta-MFW algorithm achieving a $1/ e$ -regret of $O (T)$ at the cost of $T^{3/2}$ stochastic gradient evaluations per round. As far as we know, Meta-MFW is the first algorithm to obtain $1/ e$ -regret of $O (T)$ for the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set. Furthermore, in sharp contrast with ODC algorithm \citep{thang2021online}, Meta-MFW relies on the simple online linear oracle without discretization, lifting, or rounding operations. Considering the practical restrictions, we then propose the Mono-MFW algorithm, which reduces the per-function stochastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques