Adaptive Sampling using POMDPs with Domain-Specific Considerations

Gautam Salhotra; Christopher E. Denniston; David A. Caron; Gaurav S.; Sukhatme

arXiv:2109.11595·cs.RO·September 27, 2021

Adaptive Sampling using POMDPs with Domain-Specific Considerations

Gautam Salhotra, Christopher E. Denniston, David A. Caron, Gaurav S., Sukhatme

PDF

Open Access 1 Repo

TL;DR

This paper enhances POMDP-based adaptive sampling by optimizing rollout allocation, exploration strategies, and plan commitment, resulting in improved sampling efficiency demonstrated on simulated and real underwater robot data.

Contribution

It introduces novel improvements to POMDP solvers for adaptive sampling, including dynamic rollout allocation, informed exploration, and efficient plan execution, which are validated experimentally.

Findings

01

Adaptive sampling performance improved with proposed methods

02

Dynamic rollout allocation outperforms fixed strategies

03

Fewer rollouts needed for effective sampling

Abstract

We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that rollouts are more valuable after some initial information is gained about the environment. Thus, a linear increase in the number of rollouts, i.e. allocating a fixed number at each step, is not appropriate for adaptive sampling tasks. The second alters which actions the agent chooses to explore when building the planning tree. We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore. The third improvement is in determining how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uscresl/adaptivesamplingpomcp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robotic Path Planning Algorithms