Optimal Regularized Online Allocation by Adaptive Re-Solving

Wanteng Ma; Ying Cao; Danny H.K. Tsang; Dong Xia

arXiv:2209.00399·cs.LG·July 18, 2023·1 cites

Optimal Regularized Online Allocation by Adaptive Re-Solving

Wanteng Ma, Ying Cao, Danny H.K. Tsang, Dong Xia

PDF

Open Access

TL;DR

This paper presents an adaptive dual-based algorithm framework for regularized online resource allocation that achieves optimal regret bounds and reduces computational costs through infrequent re-solving, applicable to non-concave rewards and complex constraints.

Contribution

It introduces a novel adaptive dual-based framework with approximate solutions, eliminating the log-log regret factor and enabling fast, flexible algorithms for complex online allocation problems.

Findings

01

Achieves optimal logarithmic regret under certain conditions.

02

Eliminates the log-log factor in regret bounds through dual analysis.

03

Reduces computational complexity with infrequent re-solving scheme.

Abstract

This paper introduces a dual-based algorithm framework for solving the regularized online resource allocation problems, which have potentially non-concave cumulative rewards, hard resource constraints, and a non-separable regularizer. Under a strategy of adaptively updating the resource constraints, the proposed framework only requests approximate solutions to the empirical dual problems up to a certain accuracy and yet delivers an optimal logarithmic regret under a locally second-order growth condition. Surprisingly, a delicate analysis of the dual objective function enables us to eliminate the notorious log-log factor in regret bound. The flexible framework renders renowned and computationally fast algorithms immediately applicable, e.g., dual stochastic gradient descent. Additionally, an infrequent re-solving scheme is proposed, which significantly reduces computational demands…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques