A Dynamic Programming Algorithm for Finding an Optimal Sequence of   Informative Measurements

Peter N. Loxley; Ka-Wai Cheung

arXiv:2109.11808·cs.LG·February 1, 2023

A Dynamic Programming Algorithm for Finding an Optimal Sequence of Informative Measurements

Peter N. Loxley, Ka-Wai Cheung

PDF

Open Access

TL;DR

This paper introduces a dynamic programming algorithm that plans optimal sequences of informative measurements for autonomous agents, improving efficiency over greedy methods in tasks like global search and active sensing.

Contribution

It presents a general-purpose, first-principles dynamic programming approach for planning informative measurement sequences applicable to various states, controls, and dynamics.

Findings

01

Reduces measurement count by approximately 50% in global search tasks

02

Enables real-time planning using approximate dynamic programming techniques

03

Outperforms greedy approaches with non-myopic measurement sequences

Abstract

An informative measurement is the most efficient way to gain information about an unknown state. We present a first-principles derivation of a general-purpose dynamic programming algorithm that returns an optimal sequence of informative measurements by sequentially maximizing the entropy of possible measurement outcomes. This algorithm can be used by an autonomous agent or robot to decide where best to measure next, planning a path corresponding to an optimal sequence of informative measurements. The algorithm is applicable to states and controls that are either continuous or discrete, and agent dynamics that is either stochastic or deterministic; including Markov decision processes and Gaussian processes. Recent results from the fields of approximate dynamic programming and reinforcement learning, including on-line approximations such as rollout and Monte Carlo tree search, allow the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Research in Systems and Signal Processing