Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
Yi Ma, Hongyao Tang, Chenjun Xiao, Yaodong Yang, Wei Wei, Jianye Hao, Jiye Liang

TL;DR
This survey reviews how scaling data, networks, and training budgets can enhance deep reinforcement learning performance, highlighting strategies, challenges, and future directions for decision-making applications.
Contribution
It systematically analyzes scaling strategies in DRL across data, network, and training budgets, providing a comprehensive roadmap for future research and practical implementation.
Findings
Data scaling improves learning efficiency through parallel sampling.
Network scaling enhances model expressivity with architectural innovations.
Training budget scaling accelerates convergence via distributed training and large batch sizes.
Abstract
In recent years, the expansion of neural network models and training data has driven remarkable progress in deep learning, particularly in computer vision and natural language processing. This advancement is underpinned by the concept of Scaling Laws, which demonstrates that scaling model parameters and training data enhances learning performance. While these fields have witnessed breakthroughs, such as the development of large language models like GPT-4 and advanced vision models like Midjourney, the application of scaling laws in deep reinforcement learning (DRL) remains relatively unexplored. Despite its potential to improve performance, the integration of scaling laws into DRL for decision making has not been fully realized. This review addresses this gap by systematically analyzing scaling strategies in three dimensions: data, network, and training budget. In data scaling, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
