Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement   Learning

Th\'eo Vincent; Fabian Wahren; Jan Peters; Boris Belousov; Carlo; D'Eramo

arXiv:2405.16195·cs.LG·March 4, 2025

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning

Th\'eo Vincent, Fabian Wahren, Jan Peters, Boris Belousov, Carlo, D'Eramo

PDF

Open Access 1 Video

TL;DR

AdaQN is a novel AutoRL method that adaptively selects hyperparameters on-the-fly without extra samples, improving efficiency, stability, and performance in deep RL tasks.

Contribution

Introduces AdaQN, a new adaptive hyperparameter selection method for RL that accounts for non-stationarity without additional samples, enhancing AutoRL applicability.

Findings

01

Improves sample-efficiency and training stability in RL tasks.

02

Demonstrates superior performance on MuJoCo and Atari benchmarks.

03

Shows robustness to stochastic environments.

Abstract

Deep Reinforcement Learning (RL) is well known for being highly sensitive to hyperparameters, requiring practitioners substantial efforts to optimize them for the problem at hand. This also limits the applicability of RL in real-world scenarios. In recent years, the field of automated Reinforcement Learning (AutoRL) has grown in popularity by trying to address this issue. However, these approaches typically hinge on additional samples to select well-performing hyperparameters, hindering sample-efficiency and practicality. Furthermore, most AutoRL methods are heavily based on already existing AutoML methods, which were originally developed neglecting the additional challenges inherent to RL due to its non-stationarities. In this work, we propose a new approach for AutoRL, called Adaptive $Q$ -Network (AdaQN), that is tailored to RL to take into account the non-stationarity of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning· slideslive

Taxonomy

TopicsNeural Networks and Applications