A Dynamic Programming Algorithm for Finding an Optimal Sequence of Informative Measurements
Peter N. Loxley, Ka-Wai Cheung

TL;DR
This paper introduces a dynamic programming algorithm that plans optimal sequences of informative measurements for autonomous agents, improving efficiency over greedy methods in tasks like global search and active sensing.
Contribution
It presents a general-purpose, first-principles dynamic programming approach for planning informative measurement sequences applicable to various states, controls, and dynamics.
Findings
Reduces measurement count by approximately 50% in global search tasks
Enables real-time planning using approximate dynamic programming techniques
Outperforms greedy approaches with non-myopic measurement sequences
Abstract
An informative measurement is the most efficient way to gain information about an unknown state. We present a first-principles derivation of a general-purpose dynamic programming algorithm that returns an optimal sequence of informative measurements by sequentially maximizing the entropy of possible measurement outcomes. This algorithm can be used by an autonomous agent or robot to decide where best to measure next, planning a path corresponding to an optimal sequence of informative measurements. The algorithm is applicable to states and controls that are either continuous or discrete, and agent dynamics that is either stochastic or deterministic; including Markov decision processes and Gaussian processes. Recent results from the fields of approximate dynamic programming and reinforcement learning, including on-line approximations such as rollout and Monte Carlo tree search, allow the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Research in Systems and Signal Processing
