Corruption-Robust Offline Reinforcement Learning with General Function   Approximation

Chenlu Ye; Rui Yang; Quanquan Gu; Tong Zhang

arXiv:2310.14550·cs.LG·February 20, 2024·1 cites

Corruption-Robust Offline Reinforcement Learning with General Function Approximation

Chenlu Ye, Rui Yang, Quanquan Gu, Tong Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new offline reinforcement learning algorithm that is robust to data corruption, providing theoretical guarantees that match lower bounds in linear MDP settings.

Contribution

It proposes a corruption-robust offline RL algorithm using an uncertainty-weighting technique, with tight bounds in linear MDPs, addressing adversarial data corruption.

Findings

01

Achieves suboptimality bounds with additive corruption-dependent terms.

02

Reduces to known bounds in linear MDPs, matching lower bounds.

03

Provides a new method for corruption-robust offline RL with theoretical guarantees.

Abstract

We investigate the problem of corruption robustness in offline reinforcement learning (RL) with general function approximation, where an adversary can corrupt each sample in the offline dataset, and the corruption level $ζ \geq 0$ quantifies the cumulative corruption amount over $n$ episodes and $H$ steps. Our goal is to find a policy that is robust to such corruption and minimizes the suboptimality gap with respect to the optimal policy for the uncorrupted Markov decision processes (MDPs). Drawing inspiration from the uncertainty-weighting technique from the robust online RL setting \citep{he2022nearly,ye2022corruptionrobust}, we design a new uncertainty weight iteration procedure to efficiently compute on batched samples and propose a corruption-robust algorithm for offline RL. Notably, under the assumption of single policy coverage and the knowledge of $ζ$ , our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yangrui2015/uwmsg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques · Advanced Causal Inference Techniques