Tight Bounds for Online Convex Optimization with Adversarial Constraints

Abhishek Sinha; Rahul Vaze

arXiv:2405.09296·cs.LG·May 16, 2024

Tight Bounds for Online Convex Optimization with Adversarial Constraints

Abhishek Sinha, Rahul Vaze

PDF

Open Access

TL;DR

This paper proves that in constrained online convex optimization, it is possible to achieve both sublinear regret and constraint violation simultaneously, resolving a long-standing open problem.

Contribution

The paper introduces a novel online policy that attains $O( oot{T} ext{)}$ regret and constraint violation without restrictive assumptions, combining AdaGrad and Lyapunov optimization.

Findings

01

Achieves $O( oot{T} ext{)}$ regret and $ ilde{O}( oot{T} ext{)}$ constraint violation.

02

Provides a short and elegant analysis.

03

Answers a long-standing open question in constrained online convex optimization.

Abstract

A well-studied generalization of the standard online convex optimization (OCO) is constrained online convex optimization (COCO). In COCO, on every round, a convex cost function and a convex constraint function are revealed to the learner after the action for that round is chosen. The objective is to design an online policy that simultaneously achieves a small regret while ensuring small cumulative constraint violation (CCV) against an adaptive adversary. A long-standing open question in COCO is whether an online policy can simultaneously achieve $O (T)$ regret and $O (T)$ CCV without any restrictive assumptions. For the first time, we answer this in the affirmative and show that an online policy can simultaneously achieve $O (T)$ regret and $\tilde{O} (T)$ CCV. We establish this result by effectively combining the adaptive regret bound of the AdaGrad algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Security in Wireless Sensor Networks

MethodsAdaGrad