Tight Bounds for Online Convex Optimization with Adversarial Constraints
Abhishek Sinha, Rahul Vaze

TL;DR
This paper proves that in constrained online convex optimization, it is possible to achieve both sublinear regret and constraint violation simultaneously, resolving a long-standing open problem.
Contribution
The paper introduces a novel online policy that attains $O( oot{T} ext{)}$ regret and constraint violation without restrictive assumptions, combining AdaGrad and Lyapunov optimization.
Findings
Achieves $O( oot{T} ext{)}$ regret and $ ilde{O}( oot{T} ext{)}$ constraint violation.
Provides a short and elegant analysis.
Answers a long-standing open question in constrained online convex optimization.
Abstract
A well-studied generalization of the standard online convex optimization (OCO) is constrained online convex optimization (COCO). In COCO, on every round, a convex cost function and a convex constraint function are revealed to the learner after the action for that round is chosen. The objective is to design an online policy that simultaneously achieves a small regret while ensuring small cumulative constraint violation (CCV) against an adaptive adversary. A long-standing open question in COCO is whether an online policy can simultaneously achieve regret and CCV without any restrictive assumptions. For the first time, we answer this in the affirmative and show that an online policy can simultaneously achieve regret and CCV. We establish this result by effectively combining the adaptive regret bound of the AdaGrad algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Security in Wireless Sensor Networks
MethodsAdaGrad
