An Optimistic Algorithm for Online Convex Optimization with Adversarial   Constraints

Jordan Lekeufack; Michael I. Jordan

arXiv:2412.08060·stat.ML·March 14, 2025

An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints

Jordan Lekeufack, Michael I. Jordan

PDF

Open Access

TL;DR

This paper introduces an optimistic algorithm for online convex optimization with adversarial constraints, leveraging predictions to improve regret and constraint violation bounds, especially when predictions are accurate.

Contribution

The paper proposes a novel optimistic algorithm that improves bounds on regret and constraint violations in online convex optimization with adversarial constraints by exploiting prediction accuracy.

Findings

01

Improved bounds of $O(\sqrt{E_T(f)})$ for regret and $O(\sqrt{E_T(g^+)})$ for constraint violations.

02

Achieves better performance with accurate predictions, reducing regret and violations.

03

Extends to adversarial contextual bandits with risk constraints, providing optimistic bounds.

Abstract

We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the algorithm has access to predictions of the loss and constraint functions. Our results show that we can improve the current best bounds of $O (T)$ regret and $\tilde{O} (T)$ cumulative constraint violations to $O (E_{T} (f))$ and $\tilde{O} (E_{T} (g^{+}))$ , respectively, where $E_{T} (f)$ and $E_{T} (g^{+})$ represent the cumulative prediction errors of the loss and constraint functions. In the worst case, where $E_{T} (f) = O (T)$ and $E_{T} (g^{+}) = O (T)$ (assuming bounded gradients of the loss and constraint functions), our rates match the prior $O (T)$ results. However, when the loss and constraint predictions are accurate, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Security in Wireless Sensor Networks · Robotic Path Planning Algorithms

MethodsFocus