Trading Regret for Efficiency: Online Convex Optimization with Long Term Constraints
Mehrdad Mahdavi, Rong Jin, Tianbao Yang

TL;DR
This paper introduces a new framework for online convex optimization that relaxes the need for immediate feasibility, focusing on long-term constraint satisfaction, and offers algorithms with sublinear regret and constraint violation bounds.
Contribution
It proposes an efficient online convex optimization framework that ensures long-term constraint satisfaction with improved regret bounds, using novel algorithms based on convex-concave optimization and Mirror Prox.
Findings
Achieves A(\, ext{T}) ext{ regret} and A(T^{3/4}) ext{ constraint violation} with the first algorithm.
Guarantees long-term constraint satisfaction at the expense of higher regret bounds.
Extends to partial feedback scenarios with a bandit feedback algorithm achieving similar bounds.
Abstract
In this paper we propose a framework for solving constrained online convex optimization problem. Our motivation stems from the observation that most algorithms proposed for online convex optimization require a projection onto the convex set from which the decisions are made. While for simple shapes (e.g. Euclidean ball) the projection is straightforward, for arbitrary complex sets this is the main computational challenge and may be inefficient in practice. In this paper, we consider an alternative online convex optimization problem. Instead of requiring decisions belong to for all rounds, we only require that the constraints which define the set be satisfied in the long run. We show that our framework can be utilized to solve a relaxed version of online learning with side constraints addressed in \cite{DBLP:conf/colt/MannorT06} and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
