On Dynamic Regret and Constraint Violations in Constrained Online Convex Optimization
Rahul Vaze

TL;DR
This paper introduces an algorithm for constrained online convex optimization that effectively balances minimizing dynamic regret and constraint violations, with bounds tied to the path-length of optimal actions.
Contribution
The paper presents a novel projected gradient descent algorithm with optimal bounds for dynamic regret and constraint violations in constrained OCO.
Findings
Both dynamic regret and constraint violation are bounded by the path-length.
The bounds achieved are proven to be optimal.
The algorithm operates with only one gradient evaluation per iteration.
Abstract
A constrained version of the online convex optimization (OCO) problem is considered. With slotted time, for each slot, first an action is chosen. Subsequently the loss function and the constraint violation penalty evaluated at the chosen action point is revealed. For each slot, both the loss function as well as the function defining the constraint set is assumed to be smooth and strongly convex. In addition, once an action is chosen, local information about a feasible set within a small neighborhood of the current action is also revealed. An algorithm is allowed to compute at most one gradient at its point of choice given the described feedback to choose the next action. The goal of an algorithm is to simultaneously minimize the dynamic regret (loss incurred compared to the oracle's loss) and the constraint violation penalty (penalty accrued compared to the oracle's penalty). We propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Optimization and Search Problems
