Learning Agents With Prioritization and Parameter Noise in Continuous   State and Action Space

Rajesh Mangannavar; Gopalakrishnan Srinivasaraghavan

arXiv:2410.11250·cs.AI·October 16, 2024

Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space

Rajesh Mangannavar, Gopalakrishnan Srinivasaraghavan

PDF

TL;DR

This paper introduces a prioritized reinforcement learning approach combining DQN and DDPG with parameter noise, achieving improved robustness and performance in continuous state and action space problems like robotics and autonomous vehicles.

Contribution

It presents a novel prioritized method integrating DQN and DDPG with parameter noise, enhancing performance and robustness in continuous RL tasks.

Findings

01

Outperforms previous methods in continuous RL tasks

02

Parameter noise improves training robustness

03

Achieves significant performance gains

Abstract

Among the many variants of RL, an important class of problems is where the state and action spaces are continuous -- autonomous robots, autonomous vehicles, optimal control are all examples of such problems that can lend themselves naturally to reinforcement based algorithms, and have continuous state and action spaces. In this paper, we introduce a prioritized form of a combination of state-of-the-art approaches such as Deep Q-learning (DQN) and Deep Deterministic Policy Gradient (DDPG) to outperform the earlier results for continuous state and action space problems. Our experiments also involve the use of parameter noise during training resulting in more robust deep RL models outperforming the earlier results significantly. We believe these results are a valuable addition for continuous state and action space problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning