Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
Victor R. F. Miranda, Armando A. Neto, Gustavo M. Freitas, Leonardo A. Mozelli

TL;DR
This paper introduces a novel reward shaping method combined with the SAC algorithm to improve the generalization of deep reinforcement learning policies for robotic local navigation, reducing local minima and collisions in unknown environments.
Contribution
It proposes a new reward function that uses map information to enhance policy generalization and demonstrates SAC's effectiveness over other algorithms in navigation tasks.
Findings
The reward function improves navigation in untrained scenarios.
SAC outperforms other algorithms in collision avoidance.
Method reduces local minima in navigation policies.
Abstract
In this paper, we study the application of DRL algorithms in the context of local navigation problems, in which a robot moves towards a goal location in unknown and cluttered workspaces equipped only with limited-range exteroceptive sensors, such as LiDAR. Collision avoidance policies based on DRL present some advantages, but they are quite susceptible to local minima, once their capacity to learn suitable actions is limited to the sensor range. Since most robots perform tasks in unstructured environments, it is of great interest to seek generalized local navigation policies capable of avoiding local minima, especially in untrained scenarios. To do so, we propose a novel reward function that incorporates map information gained in the training stage, increasing the agent's capacity to deliberate about the best course of action. Also, we use the SAC algorithm for training our ANN, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
Methods1x1 Convolution · Average Pooling · Global Average Pooling · Dilated Convolution · Convolution · Switchable Atrous Convolution
