Statistical Reinforcement Learning in the Real World: A Survey of Challenges and Future Directions

Asim H. Gazi; Yongyi Guo; Daiqi Gao; Ziping Xu; Kelly W. Zhang; Susan A. Murphy

arXiv:2601.15353·stat.AP·January 23, 2026

Statistical Reinforcement Learning in the Real World: A Survey of Challenges and Future Directions

Asim H. Gazi, Yongyi Guo, Daiqi Gao, Ziping Xu, Kelly W. Zhang, Susan A. Murphy

PDF

Open Access

TL;DR

This survey reviews recent advances in statistical reinforcement learning addressing real-world challenges like limited interactions and environment changes, emphasizing methods for data efficiency, continual improvement, and future research directions.

Contribution

It provides a comprehensive overview of recent statistical RL methods tailored for practical deployment challenges and outlines future research directions for impactful real-world applications.

Findings

01

Methods for maximizing data utility in offline analysis

02

Techniques for enhancing sample efficiency online

03

Strategies for designing deployment sequences for continual improvement

Abstract

Reinforcement learning (RL) has achieved remarkable success in real-world decision-making across diverse domains, including gaming, robotics, online advertising, public health, and natural language processing. Despite these advances, a substantial gap remains between RL research and its deployment in many practical settings. Two recurring challenges often underlie this gap. First, many settings offer limited opportunity for the agent to interact extensively with the target environment due to practical constraints. Second, many target environments often undergo substantial changes, requiring redesign and redeployment of RL systems (e.g., advancements in science and technology that change the landscape of healthcare delivery). Addressing these challenges and bridging the gap between basic research and application requires theory and methodology that directly inform the design,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Gaussian Processes and Bayesian Inference · Mobile Crowdsensing and Crowdsourcing