Linear Submodular Maximization with Bandit Feedback
Wenjing Chen, Victoria G. Crawford

TL;DR
This paper introduces algorithms for maximizing linear-structured submodular functions under bandit feedback, achieving near-oracle approximation guarantees and demonstrating significant empirical improvements in sample efficiency.
Contribution
It develops novel approximation algorithms for linear-structured submodular maximization with bandit feedback, leveraging adaptive allocation inspired by linear bandit methods.
Findings
Algorithms achieve approximation guarantees close to oracle access.
Significant empirical improvements in sample efficiency.
Effective in move recommendation applications.
Abstract
Submodular optimization with bandit feedback has recently been studied in a variety of contexts. In a number of real-world applications such as diversified recommender systems and data summarization, the submodular function exhibits additional linear structure. We consider developing approximation algorithms for the maximization of a submodular objective function , where . It is assumed that we have value oracle access to the functions , but the coefficients are unknown, and can only be accessed via noisy queries. We develop algorithms for this setting inspired by adaptive allocation algorithms in the best-arm identification for linear bandit, with approximation guarantees arbitrarily close to the setting where we have value oracle access to . Finally, we empirically demonstrate that our algorithms make vast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Quantum Computing Algorithms and Architecture
