Hyperparameter Tuning for Deep Reinforcement Learning Applications
Mariam Kiran, Melis Ozyildirim

TL;DR
This paper introduces a distributed genetic algorithm framework to efficiently tune hyperparameters in deep reinforcement learning, enhancing training speed, robustness, and deployment readiness across diverse applications.
Contribution
It presents a novel scalable genetic algorithm approach for hyperparameter tuning in deep RL, outperforming Bayesian methods in efficiency and robustness.
Findings
Fewer training episodes needed with more generations
Improved robustness of RL models for deployment
Scalable tuning across simple to complex RL problems
Abstract
Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to complex data centers. However, setting the right hyperparameters can have a huge impact on the deployed solution performance and reliability in the inference models, produced via RL, used for decision-making. Hyperparameter search itself is a laborious process that requires many iterations and computationally expensive to find the best settings that produce the best neural network architectures. In comparison to other neural network architectures, deep RL has not witnessed much hyperparameter tuning, due to its algorithm complexity and simulation platforms needed. In this paper, we propose a distributed variable-length genetic algorithm framework to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms · Evolutionary Algorithms and Applications
