Cost-Bounded Active Classification Using Partially Observable Markov   Decision Processes

Bo Wu; Mohamadreza Ahmadi; Suda Bharadwaj; and Ufuk Topcu

arXiv:1810.00097·cs.SY·October 2, 2018

Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes

Bo Wu, Mohamadreza Ahmadi, Suda Bharadwaj, and Ufuk Topcu

PDF

Open Access

TL;DR

This paper develops a decision-theoretic framework using POMDPs for active classification of dynamical systems modeled as MDPs, aiming for efficient and confident model identification within cost and time constraints.

Contribution

It introduces a novel POMDP-based approach for active classification of MDP models, including exact and approximate strategies for decision-making under cost and confidence constraints.

Findings

01

Exact strategy computed via value iteration

02

Approximate strategy using adaptive sampling

03

Successful application in medical diagnosis and intrusion detection

Abstract

Active classification, i.e., the sequential decision-making process aimed at data acquisition for classification purposes, arises naturally in many applications, including medical diagnosis, intrusion detection, and object tracking. In this work, we study the problem of actively classifying dynamical systems with a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the dynamical system, and observe its reactions so that the true model is determined efficiently with high confidence. To this end, we present a decision-theoretic framework based on partially observable Markov decision processes (POMDPs). The proposed framework relies on assigning a classification belief (a probability distribution) to each candidate MDP model. Given an initial belief, some misclassification probabilities, a cost bound, and a finite time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Water Systems and Optimization