Online Nonsubmodular Minimization with Delayed Costs: From Full   Information to Bandit Feedback

Tianyi Lin; Aldo Pacchiano; Yaodong Yu; Michael I. Jordan

arXiv:2205.07217·cs.LG·June 2, 2022

Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback

Tianyi Lin, Aldo Pacchiano, Yaodong Yu, Michael I. Jordan

PDF

Open Access

TL;DR

This paper develops algorithms with regret guarantees for online nonsubmodular minimization under delayed feedback, extending to bandit settings and unbounded delays, with applications in sparse estimation and Bayesian optimization.

Contribution

It introduces regret bounds for online nonsubmodular minimization with delays, extending convex relaxation techniques to this setting and analyzing both full information and bandit feedback.

Findings

01

Regret bounds hold even with unbounded delays.

02

Algorithms perform well in simulations for sparse estimation and Bayesian optimization.

03

Extension of convex relaxation analysis to nonsubmodular functions.

Abstract

Motivated by applications to online learning in sparse estimation and Bayesian optimization, we consider the problem of online unconstrained nonsubmodular minimization with delayed costs in both full information and bandit feedback settings. In contrast to previous works on online unconstrained submodular minimization, we focus on a class of nonsubmodular functions with special structure, and prove regret guarantees for several variants of the online and approximate online bandit gradient descent algorithms in static and delayed scenarios. We derive bounds for the agent's regret in the full information and bandit feedback setting, even if the delay between choosing a decision and receiving the incurred cost is unbounded. Key to our approach is the notion of $(α, β)$ -regret and the extension of the generic convex relaxation model from~\citet{El-2020-Optimal}, the analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques