Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning

Arvi Jonnarth; Ola Johansson; Jie Zhao; Michael Felsberg

arXiv:2406.04920·cs.RO·August 26, 2025

Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning

Arvi Jonnarth, Ola Johansson, Jie Zhao, Michael Felsberg

PDF

Open Access

TL;DR

This paper explores using reinforcement learning with a novel map representation and reward function for online coverage path planning, demonstrating successful sim-to-real transfer and superior performance over existing methods.

Contribution

It introduces a new RL-based approach with a frontier-based map and total variation reward, along with a semi-virtual training environment for effective sim-to-real transfer in CPP.

Findings

01

Outperforms previous RL and classical methods in simulation

02

Successfully transfers to real robot deployment

03

Effective environment randomization enhances robustness

Abstract

Coverage path planning (CPP) is the problem of finding a path that covers the entire free space of a confined area, with applications ranging from robotic lawn mowing to search-and-rescue. While for known environments, offline methods can find provably complete paths, and in some cases optimal solutions, unknown environments need to be planned online during mapping. We investigate the suitability of continuous-space reinforcement learning (RL) for this challenging problem, and propose a computationally feasible egocentric map representation based on frontiers, as well as a novel reward term based on total variation to promote complete coverage. Compared to existing classical methods, this approach allows for a flexible path space, and enables the agent to adapt to specific environment characteristics. Meanwhile, the deployment of RL models on real robot systems is difficult. Training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Control and Dynamics of Mobile Robots