Adaptive Sampling using POMDPs with Domain-Specific Considerations
Gautam Salhotra, Christopher E. Denniston, David A. Caron, Gaurav S., Sukhatme

TL;DR
This paper enhances POMDP-based adaptive sampling by optimizing rollout allocation, exploration strategies, and plan commitment, resulting in improved sampling efficiency demonstrated on simulated and real underwater robot data.
Contribution
It introduces novel improvements to POMDP solvers for adaptive sampling, including dynamic rollout allocation, informed exploration, and efficient plan execution, which are validated experimentally.
Findings
Adaptive sampling performance improved with proposed methods
Dynamic rollout allocation outperforms fixed strategies
Fewer rollouts needed for effective sampling
Abstract
We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that rollouts are more valuable after some initial information is gained about the environment. Thus, a linear increase in the number of rollouts, i.e. allocating a fixed number at each step, is not appropriate for adaptive sampling tasks. The second alters which actions the agent chooses to explore when building the planning tree. We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore. The third improvement is in determining how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robotic Path Planning Algorithms
