Statistical Reinforcement Learning in the Real World: A Survey of Challenges and Future Directions
Asim H. Gazi, Yongyi Guo, Daiqi Gao, Ziping Xu, Kelly W. Zhang, Susan A. Murphy

TL;DR
This survey reviews recent advances in statistical reinforcement learning addressing real-world challenges like limited interactions and environment changes, emphasizing methods for data efficiency, continual improvement, and future research directions.
Contribution
It provides a comprehensive overview of recent statistical RL methods tailored for practical deployment challenges and outlines future research directions for impactful real-world applications.
Findings
Methods for maximizing data utility in offline analysis
Techniques for enhancing sample efficiency online
Strategies for designing deployment sequences for continual improvement
Abstract
Reinforcement learning (RL) has achieved remarkable success in real-world decision-making across diverse domains, including gaming, robotics, online advertising, public health, and natural language processing. Despite these advances, a substantial gap remains between RL research and its deployment in many practical settings. Two recurring challenges often underlie this gap. First, many settings offer limited opportunity for the agent to interact extensively with the target environment due to practical constraints. Second, many target environments often undergo substantial changes, requiring redesign and redeployment of RL systems (e.g., advancements in science and technology that change the landscape of healthcare delivery). Addressing these challenges and bridging the gap between basic research and application requires theory and methodology that directly inform the design,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Gaussian Processes and Bayesian Inference · Mobile Crowdsensing and Crowdsourcing
