Hidden Markov Models with Multiple Observation Processes
James Y. Zhao

TL;DR
This paper analyzes a hidden Markov model with multiple observation processes, showing that the optimal policy for minimizing the information state entropy is a threshold policy, and providing a formula for the limiting entropy.
Contribution
It proves convergence of the information state in a special case and derives a precise formula for the limiting entropy, facilitating optimal policy computation.
Findings
Optimal policy is always a threshold policy.
The limiting entropy can be calculated with high precision.
The greedy policy is nearly optimal.
Abstract
We consider a hidden Markov model with multiple observation processes, one of which is chosen at each point in time by a policy---a deterministic function of the information state---and attempt to determine which policy minimises the limiting expected entropy of the information state. Focusing on a special case, we prove analytically that the information state always converges in distribution, and derive a formula for the limiting entropy which can be used for calculations with high precision. Using this fomula, we find computationally that the optimal policy is always a threshold policy, allowing it to be easily found. We also find that the greedy policy is almost optimal.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Gaussian Processes and Bayesian Inference · Neural Networks and Applications
