No-Regret Replanning under Uncertainty

Wen Sun; Niteesh Sood; Debadeepta Dey; Gireeja Ranade; Siddharth; Prakash; Ashish Kapoor

arXiv:1609.05162·cs.RO·September 19, 2016

No-Regret Replanning under Uncertainty

Wen Sun, Niteesh Sood, Debadeepta Dey, Gireeja Ranade, Siddharth, Prakash, Ashish Kapoor

PDF

Open Access

TL;DR

This paper introduces a no-regret online path planning algorithm for environments with uncertain latent information, modeled via Gaussian Processes, balancing exploration and exploitation effectively.

Contribution

It adapts UCB bandit algorithms to robotic path planning under uncertainty, providing a theoretically grounded approach with proven no-regret guarantees.

Findings

01

Effective in aircraft flight path planning with partial wind observations

02

Balances exploration and exploitation near-optimally

03

Demonstrates theoretical no-regret properties

Abstract

This paper explores the problem of path planning under uncertainty. Specifically, we consider online receding horizon based planners that need to operate in a latent environment where the latent information can be modeled via Gaussian Processes. Online path planning in latent environments is challenging since the robot needs to explore the environment to get a more accurate model of latent information for better planning later and also achieves the task as quick as possible. We propose UCB style algorithms that are popular in the bandit settings and show how those analyses can be adapted to the online robotic path planning problems. The proposed algorithm trades-off exploration and exploitation in near-optimal manner and has appealing no-regret properties. We demonstrate the efficacy of the framework on the application of aircraft flight path planning when the winds are partially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms