No-Regret M${}^{\natural}$-Concave Function Maximization: Stochastic Bandit Algorithms and Hardness of Adversarial Full-Information Setting
Taihei Oki, Shinsaku Sakaue

TL;DR
This paper develops algorithms for maximizing M${}^{ atural}$-concave functions in stochastic bandit settings with sublinear regret, and proves hardness results for adversarial full-information scenarios, highlighting the problem's complexity.
Contribution
It introduces new stochastic bandit algorithms with regret guarantees for M${}^{ atural}$-concave functions and establishes computational hardness in adversarial settings.
Findings
Achieved $O(T^{-1/2})$-simple regret and $O(T^{2/3})$-regret algorithms in stochastic bandit setting.
Proved polynomial-time algorithms cannot attain sublinear regret in adversarial full-information setting.
Demonstrated robustness of greedy algorithms to local errors in M${}^{ atural}$-concave maximization.
Abstract
M-concave functions, a.k.a. gross substitute valuation functions, play a fundamental role in many fields, including discrete mathematics and economics. In practice, perfect knowledge of M-concave functions is often unavailable a priori, and we can optimize them only interactively based on some feedback. Motivated by such situations, we study online M-concave function maximization problems, which are interactive versions of the problem studied by Murota and Shioura (1999). For the stochastic bandit setting, we present -simple regret and -regret algorithms under times access to unbiased noisy value oracles of M-concave functions. A key to proving these results is the robustness of the greedy algorithm to local errors in M-concave function maximization, which is one of our main technical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Advanced Bandit Algorithms Research
