Statistical and Algorithmic Foundations of Reinforcement Learning

Yuejie Chi; Yuxin Chen; Yuting Wei

arXiv:2507.14444·stat.ML·July 22, 2025

Statistical and Algorithmic Foundations of Reinforcement Learning

Yuejie Chi, Yuxin Chen, Yuting Wei

PDF

TL;DR

This paper reviews recent theoretical and algorithmic advances in reinforcement learning, focusing on sample efficiency, computational challenges, and various RL scenarios using Markov Decision Processes.

Contribution

It synthesizes key developments in RL theory and algorithms, connecting classical ideas with new approaches across multiple RL settings.

Findings

01

Highlights the importance of sample complexity and computational efficiency in RL.

02

Discusses lower bounds and theoretical limits of RL algorithms.

03

Examines different RL scenarios including offline, online, and robust RL.

Abstract

As a paradigm for sequential decision making in unknown environments, reinforcement learning (RL) has received a flurry of attention in recent years. However, the explosion of model complexity in emerging applications and the presence of nonconvexity exacerbate the challenge of achieving efficient RL in sample-starved situations, where data collection is expensive, time-consuming, or even high-stakes (e.g., in clinical trials, autonomous systems, and online advertising). How to understand and enhance the sample and computational efficacies of RL algorithms is thus of great interest. In this tutorial, we aim to introduce several important algorithmic and theoretical developments in RL, highlighting the connections between new ideas and classical topics. Employing Markov Decision Processes as the central mathematical model, we cover several distinctive RL scenarios (i.e., RL with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.