Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes
Bo Wu, Mohamadreza Ahmadi, Suda Bharadwaj, and Ufuk Topcu

TL;DR
This paper develops a decision-theoretic framework using POMDPs for active classification of dynamical systems modeled as MDPs, aiming for efficient and confident model identification within cost and time constraints.
Contribution
It introduces a novel POMDP-based approach for active classification of MDP models, including exact and approximate strategies for decision-making under cost and confidence constraints.
Findings
Exact strategy computed via value iteration
Approximate strategy using adaptive sampling
Successful application in medical diagnosis and intrusion detection
Abstract
Active classification, i.e., the sequential decision-making process aimed at data acquisition for classification purposes, arises naturally in many applications, including medical diagnosis, intrusion detection, and object tracking. In this work, we study the problem of actively classifying dynamical systems with a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the dynamical system, and observe its reactions so that the true model is determined efficiently with high confidence. To this end, we present a decision-theoretic framework based on partially observable Markov decision processes (POMDPs). The proposed framework relies on assigning a classification belief (a probability distribution) to each candidate MDP model. Given an initial belief, some misclassification probabilities, a cost bound, and a finite time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Water Systems and Optimization
