Active sequential hypothesis testing
Mohammad Naghshvar, Tara Javidi

TL;DR
This paper develops a theoretical framework for active sequential hypothesis testing, establishing fundamental limits and proposing heuristics that achieve asymptotic optimality in information acquisition and error rates.
Contribution
It introduces new bounds on the optimal cost and analyzes heuristic policies for adaptive action selection, demonstrating asymptotic optimality and maximum information rates.
Findings
Lower bounds on the total cost and information acquisition rate are established.
A heuristic policy is shown to be asymptotically optimal as the penalty increases.
In noisy dynamic search, the second heuristic achieves maximum information rate and error exponent.
Abstract
Consider a decision maker who is responsible to dynamically collect observations so as to enhance his information about an underlying phenomena of interest in a speedy manner while accounting for the penalty of wrong declaration. Due to the sequential nature of the problem, the decision maker relies on his current information state to adaptively select the most ``informative'' sensing action among the available ones. In this paper, using results in dynamic programming, lower bounds for the optimal total cost are established. The lower bounds characterize the fundamental limits on the maximum achievable information acquisition rate and the optimal reliability. Moreover, upper bounds are obtained via an analysis of two heuristic policies for dynamic selection of actions. It is shown that the first proposed heuristic achieves asymptotic optimality, where the notion of asymptotic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
