$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization
Rahul Vaze, Abhishek Sinha

TL;DR
This paper introduces an online convex optimization algorithm that achieves an $O(\sqrt{T})$ static regret and an instance-dependent constraint violation bound, improving adaptability to specific problem structures.
Contribution
The paper presents a new algorithm for constrained online convex optimization with static regret of $O(\sqrt{T})$ and an instance-dependent constraint violation bound, leveraging geometric properties of constraints.
Findings
Achieves $O(\sqrt{T})$ static regret.
Provides an instance-dependent bound on cumulative constraint violation.
Outperforms previous universal bounds by exploiting geometric properties.
Abstract
The constrained version of the standard online convex optimization (OCO) framework, called COCO is considered, where on every round, a convex cost function and a convex constraint function are revealed to the learner after it chooses the action for that round. The objective is to simultaneously minimize the static regret and cumulative constraint violation (CCV). An algorithm is proposed that guarantees a static regret of and a CCV of , where depends on the distance between the consecutively revealed constraint sets, the shape of constraint sets, dimension of action space and the diameter of the action space. For special cases of constraint sets, . Compared to the state of the art results, static regret of and CCV of , that were universal, the new result on CCV is instance dependent, which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems
