$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation   for Constrained Online Convex Optimization

Rahul Vaze; Abhishek Sinha

arXiv:2502.05019·cs.LG·February 11, 2025

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

Rahul Vaze, Abhishek Sinha

PDF

Open Access 1 Video

TL;DR

This paper introduces an online convex optimization algorithm that achieves an $O(\sqrt{T})$ static regret and an instance-dependent constraint violation bound, improving adaptability to specific problem structures.

Contribution

The paper presents a new algorithm for constrained online convex optimization with static regret of $O(\sqrt{T})$ and an instance-dependent constraint violation bound, leveraging geometric properties of constraints.

Findings

01

Achieves $O(\sqrt{T})$ static regret.

02

Provides an instance-dependent bound on cumulative constraint violation.

03

Outperforms previous universal bounds by exploiting geometric properties.

Abstract

The constrained version of the standard online convex optimization (OCO) framework, called COCO is considered, where on every round, a convex cost function and a convex constraint function are revealed to the learner after it chooses the action for that round. The objective is to simultaneously minimize the static regret and cumulative constraint violation (CCV). An algorithm is proposed that guarantees a static regret of $O (T)$ and a CCV of $min {\cV, O (T lo g T)}$ , where $\cV$ depends on the distance between the consecutively revealed constraint sets, the shape of constraint sets, dimension of action space and the diameter of the action space. For special cases of constraint sets, $\cV = O (1)$ . Compared to the state of the art results, static regret of $O (T)$ and CCV of $O (T lo g T)$ , that were universal, the new result on CCV is instance dependent, which is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems