A Probabilistic Sample Path Convergence Time Analysis of Drift-Plus-Penalty Algorithm for Stochastic Optimization
Xiaohan Wei, Hao Yu, Michael J. Neely

TL;DR
This paper analyzes the convergence time of the drift-plus-penalty algorithm in stochastic optimization, providing probabilistic bounds and improvements for single-constraint cases.
Contribution
It offers a probabilistic sample path convergence time analysis for the drift-plus-penalty algorithm, including tighter bounds for single-constraint scenarios.
Findings
Convergence time is $ ext{O}(rac{1}{ ext{ε}^2})$ with high probability.
Sample path approximation to optimality is achieved within $ ext{O}(rac{1}{ ext{ε}^2} ext{log}^2rac{1}{ ext{ε}})$ time.
Improved convergence bounds are provided for single-constraint problems.
Abstract
This paper considers the problem of minimizing the time average of a controlled stochastic process subject to multiple time average constraints on other related processes. The probability distribution of the random events in the system is unknown to the controller. A typical application is time average power minimization subject to network throughput constraints for different users in a network with time varying channel conditions. We show that with probability at least , the classical drift-plus-penalty algorithm provides a sample path approximation to optimality with a convergence time , where is a parameter related to the algorithm. When there is only one constraint, we further show that the convergence time can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Advanced Manufacturing and Logistics Optimization · Simulation Techniques and Applications
