A PTAS for a Class of Stochastic Dynamic Programs
Hao Fu, Jian Li, Pan Xu

TL;DR
This paper introduces a polynomial time approximation scheme (PTAS) for a class of stochastic dynamic programs, including a new PTAS for a stochastic maximization problem involving adaptive probing of items with known distributions.
Contribution
The paper develops a general framework for PTAS in stochastic dynamic programs and provides the first PTAS for a stochastic combinatorial optimization problem involving adaptive probing.
Findings
First PTAS for the stochastic max problem with adaptive probing
Achieved approximation ratios better than the previous 1-1/e bound
Extended framework to other variants and related problems
Abstract
We develop a framework for obtaining polynomial time approximation schemes (PTAS) for a class of stochastic dynamic programs. Using our framework, we obtain the first PTAS for the following stochastic combinatorial optimization problems: \probemax: We are given a set of items, each item has a value which is an independent random variable with a known (discrete) distribution . We can {\em probe} a subset of items sequentially. Each time after {probing} an item , we observe its value realization, which follows the distribution . We can {\em adaptively} probe at most items and each item can be probed at most once. The reward is the maximum among the realized values. Our goal is to design an adaptive probing policy such that the expected value of the reward is maximized. To the best of our knowledge, the best known approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
