rQdia: Regularizing Q-Value Distributions With Image Augmentation

Sam Lerman; Jing Bi

arXiv:2506.21367·cs.LG·June 27, 2025

rQdia: Regularizing Q-Value Distributions With Image Augmentation

Sam Lerman, Jing Bi

PDF

Open Access

TL;DR

rQdia introduces a regularization method using image augmentation in pixel-based deep reinforcement learning, improving performance and sample efficiency across multiple benchmarks and tasks.

Contribution

It presents a novel auxiliary loss that regularizes Q-value distributions via image augmentation, enhancing existing algorithms like DrQ, SAC, and Data-Efficient Rainbow.

Findings

01

Boosts DrQ and SAC performance on most MuJoCo tasks.

02

Improves Data-Efficient Rainbow on many Atari environments.

03

Enables model-free continuous control directly from pixel inputs.

Abstract

rQdia regularizes Q-value distributions with augmented images in pixel-based deep reinforcement learning. With a simple auxiliary loss, that equalizes these distributions via MSE, rQdia boosts DrQ and SAC on 9/12 and 10/12 tasks respectively in the MuJoCo Continuous Control Suite from pixels, and Data-Efficient Rainbow on 18/26 Atari Arcade environments. Gains are measured in both sample efficiency and longer-term training. Moreover, the addition of rQdia finally propels model-free continuous control from pixels over the state encoding baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Medical Image Segmentation Techniques · AI in cancer detection