Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning
Arvi Jonnarth, Ola Johansson, Jie Zhao, Michael Felsberg

TL;DR
This paper explores using reinforcement learning with a novel map representation and reward function for online coverage path planning, demonstrating successful sim-to-real transfer and superior performance over existing methods.
Contribution
It introduces a new RL-based approach with a frontier-based map and total variation reward, along with a semi-virtual training environment for effective sim-to-real transfer in CPP.
Findings
Outperforms previous RL and classical methods in simulation
Successfully transfers to real robot deployment
Effective environment randomization enhances robustness
Abstract
Coverage path planning (CPP) is the problem of finding a path that covers the entire free space of a confined area, with applications ranging from robotic lawn mowing to search-and-rescue. While for known environments, offline methods can find provably complete paths, and in some cases optimal solutions, unknown environments need to be planned online during mapping. We investigate the suitability of continuous-space reinforcement learning (RL) for this challenging problem, and propose a computationally feasible egocentric map representation based on frontiers, as well as a novel reward term based on total variation to promote complete coverage. Compared to existing classical methods, this approach allows for a flexible path space, and enables the agent to adapt to specific environment characteristics. Meanwhile, the deployment of RL models on real robot systems is difficult. Training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Control and Dynamics of Mobile Robots
