Deep R-Learning for Continual Area Sweeping

Rishi Shah; Yuqian Jiang; Justin Hart; Peter Stone

arXiv:2006.00589·cs.LG·June 2, 2020

Deep R-Learning for Continual Area Sweeping

Rishi Shah, Yuqian Jiang, Justin Hart, Peter Stone

PDF

TL;DR

This paper introduces a reinforcement learning approach for continual area sweeping in robotics, enabling robots to adaptively learn to maximize event detection rates in unknown environments, surpassing previous greedy methods.

Contribution

It generalizes the non-uniform coverage problem to less constrained environments and applies RL in a Semi-Markov Decision Process framework for improved performance.

Findings

01

Significant performance improvements over greedy approaches.

02

Effective in both abstract and high-fidelity simulations.

03

Applicable to service robotics scenarios.

Abstract

Coverage path planning is a well-studied problem in robotics in which a robot must plan a path that passes through every point in a given area repeatedly, usually with a uniform frequency. To address the scenario in which some points need to be visited more frequently than others, this problem has been extended to non-uniform coverage planning. This paper considers the variant of non-uniform coverage in which the robot does not know the distribution of relevant events beforehand and must nevertheless learn to maximize the rate of detecting events of interest. This continual area sweeping problem has been previously formalized in a way that makes strong assumptions about the environment, and to date only a greedy approach has been proposed. We generalize the continual area sweeping formulation to include fewer environmental constraints, and propose a novel approach based on reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.