Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies

Yi Ma; Hongyao Tang; Chenjun Xiao; Yaodong Yang; Wei Wei; Jianye Hao; Jiye Liang

arXiv:2508.03194·cs.LG·August 6, 2025

Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies

Yi Ma, Hongyao Tang, Chenjun Xiao, Yaodong Yang, Wei Wei, Jianye Hao, Jiye Liang

PDF

TL;DR

This survey reviews how scaling data, networks, and training budgets can enhance deep reinforcement learning performance, highlighting strategies, challenges, and future directions for decision-making applications.

Contribution

It systematically analyzes scaling strategies in DRL across data, network, and training budgets, providing a comprehensive roadmap for future research and practical implementation.

Findings

01

Data scaling improves learning efficiency through parallel sampling.

02

Network scaling enhances model expressivity with architectural innovations.

03

Training budget scaling accelerates convergence via distributed training and large batch sizes.

Abstract

In recent years, the expansion of neural network models and training data has driven remarkable progress in deep learning, particularly in computer vision and natural language processing. This advancement is underpinned by the concept of Scaling Laws, which demonstrates that scaling model parameters and training data enhances learning performance. While these fields have witnessed breakthroughs, such as the development of large language models like GPT-4 and advanced vision models like Midjourney, the application of scaling laws in deep reinforcement learning (DRL) remains relatively unexplored. Despite its potential to improve performance, the integration of scaling laws into DRL for decision making has not been fully realized. This review addresses this gap by systematically analyzing scaling strategies in three dimensions: data, network, and training budget. In data scaling, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.