Online Learning for Approximately-Convex Functions with Long-term Adversarial Constraints
Dhruv Sarkar, Samrat Mukhopadhyay, Abhishek Sinha

TL;DR
This paper introduces an online learning algorithm for approximately-convex functions with long-term adversarial constraints, achieving near-optimal regret and resource bounds in both full-information and bandit settings.
Contribution
It develops a novel first-order online algorithm for $ ext{alpha}$-approximately convex functions with long-term constraints, extending to bandit feedback and broad problem classes.
Findings
Achieves $O( oot T)$ $ ext{alpha}$-regret against fixed benchmarks.
Provides resource consumption bounds of $O(B_T ext{log} T + ilde{O}( oot T))$.
Establishes matching lower bounds, confirming tightness of results.
Abstract
We study an online learning problem with long-term budget constraints in the adversarial setting. In this problem, at each round , the learner selects an action from a convex decision set, after which the adversary reveals a cost function and a resource consumption function . The cost and consumption functions are assumed to be -approximately convex - a broad class that generalizes convexity and encompasses many common non-convex optimization problems, including DR-submodular maximization, Online Vertex Cover, and Regularized Phase Retrieval. The goal is to design an online algorithm that minimizes cumulative cost over a horizon of length while approximately satisfying a long-term budget constraint of . We propose an efficient first-order online algorithm that guarantees -regret against the optimal fixed feasible benchmark while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
