Path Planning of Cleaning Robot with Reinforcement Learning
Woohyeon Moon, Bumgeun Park, Sarvar Hussain Nengroo, Taeyoung Kim, and, Dongsoo Har

TL;DR
This paper introduces a reinforcement learning-based path planning method for cleaning robots that adapts to various environments using transfer learning, achieving faster training and better performance than traditional approaches.
Contribution
It combines PPO with transfer learning, detection of nearest cleaned tiles, reward shaping, and elite set methods to enable efficient path planning across diverse cleaning environments.
Findings
Improved training performance and convergence speed over standard PPO.
Outperforms conventional methods like random and zigzag in cleaning efficiency.
Effective adaptation to different cleaning environments.
Abstract
Recently, as the demand for cleaning robots has steadily increased, therefore household electricity consumption is also increasing. To solve this electricity consumption issue, the problem of efficient path planning for cleaning robot has become important and many studies have been conducted. However, most of them are about moving along a simple path segment, not about the whole path to clean all places. As the emerging deep learning technique, reinforcement learning (RL) has been adopted for cleaning robot. However, the models for RL operate only in a specific cleaning environment, not the various cleaning environment. The problem is that the models have to retrain whenever the cleaning environment changes. To solve this problem, the proximal policy optimization (PPO) algorithm is combined with an efficient path planning that operates in various cleaning environments, using transfer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Smart Parking Systems Research · Reinforcement Learning in Robotics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Entropy Regularization · Proximal Policy Optimization
