Constrained Online Two-stage Stochastic Optimization: Near Optimal   Algorithms via Adversarial Learning

Jiashuo Jiang

arXiv:2302.00997·cs.LG·May 21, 2024

Constrained Online Two-stage Stochastic Optimization: Near Optimal Algorithms via Adversarial Learning

Jiashuo Jiang

PDF

Open Access

TL;DR

This paper introduces near-optimal online algorithms for two-stage stochastic optimization with long-term constraints, leveraging adversarial learning to achieve improved regret bounds under various stochastic and adversarial settings.

Contribution

The paper develops a unified adversarial learning-based framework for online two-stage stochastic optimization, achieving state-of-the-art regret bounds and robustness to adversarial corruptions.

Findings

01

Achieves $O( oot T)$ regret in stochastic settings.

02

Provides robustness to adversarial model parameter corruptions.

03

Develops algorithms with regret depending on prediction accuracy in non-stationary cases.

Abstract

We consider an online two-stage stochastic optimization with long-term constraints over a finite horizon of $T$ periods. At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage action from a feasible set that depends both on the first-stage decision and the model parameter. We aim to minimize the cumulative objective value while guaranteeing that the long-term average second-stage decision belongs to a set. We develop online algorithms for the online two-stage problem from adversarial learning algorithms. Also, the regret bound of our algorithm cam be reduced to the regret bound of embedded adversarial learning algorithms. Based on our framework, we obtain new results under various settings. When the model parameter at each period is drawn from identical distributions, we derive \textit{state-of-art} $O (T)$ regret that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Auction Theory and Applications

MethodsClass-activation map