Bayesian Sequential Detection with Phase-Distributed Change Time and   Nonlinear Penalty -- A POMDP Approach

Vikram Krishnamurthy

arXiv:1011.5298·cs.IT·March 17, 2015·2 cites

Bayesian Sequential Detection with Phase-Distributed Change Time and Nonlinear Penalty -- A POMDP Approach

Vikram Krishnamurthy

PDF

Open Access

TL;DR

This paper characterizes the structure of optimal policies in Bayesian sequential detection problems with phase-type change times, using a POMDP framework, and demonstrates their applicability across various decision scenarios.

Contribution

It introduces a threshold switching curve structure for optimal policies in Bayesian detection problems and develops a stochastic gradient method to estimate these thresholds.

Findings

01

Optimal policies have a threshold switching curve structure.

02

The threshold structure applies to multiple Bayesian decision problems.

03

A stochastic gradient algorithm effectively estimates the threshold curve.

Abstract

We show that the optimal decision policy for several types of Bayesian sequential detection problems has a threshold switching curve structure on the space of posterior distributions. This is established by using lattice programming and stochastic orders in a partially observed Markov decision process (POMDP) framework. A stochastic gradient algorithm is presented to estimate the optimal linear approximation to this threshold curve. We illustrate these results by first considering quickest time detection with phase-type distributed change time and a variance stopping penalty. Then it is proved that the threshold switching curve also arises in several other Bayesian decision problems such as quickest transient detection, exponential delay (risk-sensitive) penalties, stopping time problems in social learning, and multi-agent scheduling in a changing world. Using Blackwell dominance, it is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Bayesian Modeling and Causal Inference · Auction Theory and Applications