Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

Zhishuai Liu; Pan Xu

arXiv:2403.09621·cs.LG·November 3, 2025·1 cites

Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

Zhishuai Liu, Pan Xu

PDF

Open Access 1 Video

TL;DR

This paper develops minimax optimal, computationally efficient algorithms for distributionally robust offline reinforcement learning with linear models, addressing challenges of nonlinearity and uncertainty in dynamics.

Contribution

It introduces novel algorithms and theoretical analysis for robust offline RL with linear models, incorporating variance-based function approximation and instance-dependent suboptimality analysis.

Findings

01

Algorithms are minimax optimal and computationally efficient.

02

Function approximation in robust offline RL is more complex than in standard RL.

03

Theoretical insights include variance-informed approximation and suboptimality bounds.

Abstract

Distributionally robust offline reinforcement learning (RL), which seeks robust policy training against environment perturbation by modeling dynamics uncertainty, calls for function approximations when facing large state-action spaces. However, the consideration of dynamics uncertainty introduces essential nonlinearity and computational burden, posing unique challenges for analyzing and practically employing function approximation. Focusing on a basic setting where the nominal model and perturbed models are linearly parameterized, we propose minimax optimal and computationally efficient algorithms realizing function approximation and initiate the study on instance-dependent suboptimality analysis in the context of robust offline RL. Our results uncover that function approximation in robust offline RL is essentially distinct from and probably harder than that in standard offline RL. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Traffic control and management