Trading Regret for Efficiency: Online Convex Optimization with Long Term   Constraints

Mehrdad Mahdavi; Rong Jin; Tianbao Yang

arXiv:1111.6082·cs.LG·October 1, 2012·167 cites

Trading Regret for Efficiency: Online Convex Optimization with Long Term Constraints

Mehrdad Mahdavi, Rong Jin, Tianbao Yang

PDF

Open Access

TL;DR

This paper introduces a new framework for online convex optimization that relaxes the need for immediate feasibility, focusing on long-term constraint satisfaction, and offers algorithms with sublinear regret and constraint violation bounds.

Contribution

It proposes an efficient online convex optimization framework that ensures long-term constraint satisfaction with improved regret bounds, using novel algorithms based on convex-concave optimization and Mirror Prox.

Findings

01

Achieves A(\, ext{T}) ext{ regret} and A(T^{3/4}) ext{ constraint violation} with the first algorithm.

02

Guarantees long-term constraint satisfaction at the expense of higher regret bounds.

03

Extends to partial feedback scenarios with a bandit feedback algorithm achieving similar bounds.

Abstract

In this paper we propose a framework for solving constrained online convex optimization problem. Our motivation stems from the observation that most algorithms proposed for online convex optimization require a projection onto the convex set $K$ from which the decisions are made. While for simple shapes (e.g. Euclidean ball) the projection is straightforward, for arbitrary complex sets this is the main computational challenge and may be inefficient in practice. In this paper, we consider an alternative online convex optimization problem. Instead of requiring decisions belong to $K$ for all rounds, we only require that the constraints which define the set $K$ be satisfied in the long run. We show that our framework can be utilized to solve a relaxed version of online learning with side constraints addressed in \cite{DBLP:conf/colt/MannorT06} and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms