Constrained Active Classification Using Partially Observable Markov Decision Processes
Bo Wu, Niklas Lauffer, Mohamadreza Ahmadi, Suda Bharadwaj, Zhe Xu,, Ufuk Topcu

TL;DR
This paper develops a decision-theoretic framework using POMDPs for active classification of dynamical systems, enabling efficient and confident attribute identification through interaction and observation.
Contribution
It introduces a POMDP-based approach with three algorithms for active classification, including exact and approximate methods, applicable to various real-world scenarios.
Findings
Exact strategy computed via value iteration
Approximate strategies using adaptive sampling and Monte Carlo tree search
Framework demonstrated on medical, security, and wildlife classification examples
Abstract
In this work, we study the problem of actively classifying the attributes of dynamical systems characterized as a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the dynamical system and observe its reactions so that the attribute of interest is classified efficiently with high confidence. We present a decision-theoretic framework based on partially observable Markov decision processes (POMDPs). The proposed framework relies on assigning a classification belief (a probability distribution) to the attributes of interest. Given an initial belief, a confidence level over which a classification decision can be made, a cost bound, safe belief sets, and a finite time horizon, we compute POMDP strategies leading to classification decisions. We present three different algorithms to compute such strategies. The first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Anomaly Detection Techniques and Applications
