A Low Complexity Algorithm with $O(\sqrt{T})$ Regret and $O(1)$ Constraint Violations for Online Convex Optimization with Long Term Constraints
Hao Yu, Michael J. Neely

TL;DR
This paper introduces a simple online convex optimization algorithm that achieves optimal regret of $O(\sqrt{T})$ while maintaining only constant long-term constraint violations, improving efficiency over previous methods.
Contribution
The paper presents a novel low-complexity algorithm that guarantees $O(\sqrt{T})$ regret and $O(1)$ long-term constraint violations, surpassing prior algorithms in performance and simplicity.
Findings
Achieves $O(\sqrt{T})$ regret bound.
Maintains $O(1)$ constraint violations.
Simplifies implementation compared to existing methods.
Abstract
This paper considers online convex optimization over a complicated constraint set, which typically consists of multiple functional constraints and a set constraint. The conventional online projection algorithm (Zinkevich, 2003) can be difficult to implement due to the potentially high computation complexity of the projection operation. In this paper, we relax the functional constraints by allowing them to be violated at each round but still requiring them to be satisfied in the long term. This type of relaxed online convex optimization (with long term constraints) was first considered in Mahdavi et al. (2012). That prior work proposes an algorithm to achieve regret and constraint violations for general problems and another algorithm to achieve an bound for both regret and constraint violations when the constraint set can be described by a finite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
