Percentile Policies for Tracking of Markovian Random Processes with Asymmetric Cost and Observation
Parisa Mansourifard, Tara Javidi, Bhaskar Krishnamachari

TL;DR
This paper introduces a computationally feasible class of percentile-based policies for tracking Markovian processes with asymmetric costs and observations, outperforming simpler policies and approaching optimal performance.
Contribution
It proposes a novel percentile policy framework for partially observable Markov decision processes with asymmetric costs, providing a heuristic that is both practical and near-optimal.
Findings
Percentile policies outperform myopic policies in simulations.
The heuristic policy performs close to the optimal under certain conditions.
A low-complexity lower bound on the optimal cost is derived.
Abstract
Motivated by wide-ranging applications such as video delivery over networks using Multiple Description Codes, congestion control, and inventory management, we study the state-tracking of a Markovian random process with a known transition matrix and a finite ordered state set. The decision-maker must select a state as an action at each time step to minimize the total expected cost. The decision-maker is faced with asymmetries both in cost and observation: in case the selected state is less than the actual state of the Markovian process, an under-utilization cost occurs and only partial observation about the actual state is revealed; otherwise, the decision incurs an over-utilization cost and reveals full information about the actual state. We can formulate this problem as a Partially Observable Markov Decision Process which can be expressed as a dynamic program based on the last full…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Green IT and Sustainability · Caching and Content Delivery
